/^DA0430l  2 


QUESTIONNAIRE  CONSTRUCTION  MANUAL 


ANNEX 

LITERATURE  SURVEY  AND  tIBlIOGRAPHY 


FORT  HOOD  FIELD  UNIT 


>~ 

Cl- 

.o 

o 


Research  Institute  for  the  Behavioral  and  Social  Sciences 


July  1976 

Approved  for  public  release.  distnbutior>  unlimited. 


DISClilHKK 


TfflS  DOCUMENT  IS  BEST 
QUALITY  AVAILABLE.  THE  COPY 
FURNISHED  TO  DTIC  CONTAINED 
A SIGNIFICANT  NUMBER  OF 
PAGES  WHICH  DO  NOT 
REPRODUCE  LEGIBLY. 


U.  S.  ARMY  RESEARCH  INSTITUTE 

FOR  THE  BEHAVIORAL  AND  SOCIAL  SCIENCES 

A Field  Operating  Agency  under  the  Jurisdiction  of  the 
Deputy  Chief  of  Staff  for  Personnel 


J.  E.  UHLAN  ER 
Technical  Director 


W.  C.  MAUS 
COL,  CS 
Commander 


Research  accomplished  under  contract 
to  the  Department  of  the  Army 


Operations  Research  Associates 


f 


NOTICES 


□ ISTBlBUTIOiy  Primary  diitribuiion  of  thi$  report  h*i  been  made  by  ARI,  Please  address  correiponoence 
oncernmg  distribution  of  reports  to:  U.  S.  Army  Reseerch  fnstitute  for  the  Beheviorel  end  Sociel  Sciences, 

ATIN  PERI  P.  1300  Wilson  Boulevard,  Arlington,  Virginia  22209 

PIMAL  DISPOSITION:  This  report  miy  be  oestroyad  when  it  is  no  longer  needed  Please  do  not  return  it  to 
tne  U.  S.  Army  Research  Institute  for  the  Beheviorel  end  Sociel  Sciences. 

NOTE  The  findings  in  this  report  ere  not  to  be  construed  is  in  official  Department  of  the  Army  position, 
ur  esi  so  designeted  by  other  euthoriied  documents. 

♦ 


J 


Unclassified 


security  classification  of  This  page  ftrhun  Data  Enlarad) 


1 REPORT  DOCUMENTATION  PAGE 

READ  INSTRUCTIONS 
BEFORE  COMPLETING  FORM 

1 REPORT  NUMBER  J 

2.  GOVT  ACCESSION  NO. 

1 

3.  RECIPIENT'S  CATALOG  NUMBER 
- 

Title  (and 
7 

j.-  QUESTIONNAIRE  j:0NSTRUCTI0N  MNUAI,  ^NEX. 
LITERATURE  SURVEY  AND  BIBLIOGRAPHY^'. 

5 TYPE  OF  REPORT  A PERIOD  COVERED 

6.  PERFORMING  ORG.  REPORT  NUMBER 

'?.-AUTHQR1«1  - 

8.  CONTRACT  OR  GRANT  NUMBER^#; 

i Rob.ertyDyer,  J.  J. /Matthews,  J,  F,/Stulac, 

C.  E.yVJright  MM  Kenneth/Yudowitch  ^ 

1 DAHc"iy-7L-C-j6052  i 

9.  performing  ORGANIZATION  NAME  AND  ADDRESS 

Operations  Research  Associates  s/ 
Palo  Alto,  California 

10.  PROGRAM  ELEMENT.  PROJECT.  TASK 
AREA  & WORK  UNIT  NUMBERS 

20765731A775  / 

• 1.  CONTROLLING  OFFICE  NAME  AND  ADDRESS 

TRfVDOC  Combined  Arms  Test  Activity  ^ J- 

Fort  Hood,  Texas 

12.  REPORT  DATE  < ,/ 

! J]  1/ 

JuW  1970  ' J t . 

13.  NUMSCS  6F  PAGES  ' ■ ! 

426 

M monitoring  agency  name  a ADDRESSj'//  dlUarant  from  ControiUng  Office) 

U.S.  Army  Research  Institute  for  the  Behavioral  anc 
Social  Sciences 

ARI  Field  Unit-Fort  Hood  HQ  TCATA  (PERI-0H:i 
Fort  Hood,  Texas 

15.  SECURITY  CLASS,  (of  thia  report) 

Unclassified 

15a.  DECLASSI  FI  CATION /DOWN  GRADING 
SCHEDULE 

16.  DtSTRiBUTlON  ST ATEMENT  rhia  RoporO 


Aoproved  for  public  release;  distribution  unlimited 


17  OlST  3UTJON  STATEMENT  (of  th«  ahatract  eniered  in  Block  70,  if  differant  from  Raport) 


18-  supplementary  notes 

CnnLracting  Officer's  Technical  Representative  was  George  M.  Gividen,  Chief,  ARI 
Field  OffiC'C  at  Fort  Hood,  Texas.  Companion  volume  is  "Questionnaire 
Construction  Manual." 

19  key  ^ORDS  (ClinlinuB  on  revaraa  alda  it  nacaaaary  and  idantlly  by  block  numbar) 

Questionnaire  construction  Bibliography  of  test  construction  methods 

Test  development 
Item  development 
,Ques  t ionn  lire  admin i s tr.at  i on 


20  ABSTRACT  rcanittxua  an  raaarma  a/da  ft  naoaaaary  and  tdanXity  by  block  numbat) 

~^‘i\  literature  survey  on  questionnaire  construction  encompassed  journals, 
hooks,  and  reports  in  psychology,  sociology,  education,  and  marketing,  and 
documentat ion  published  by  the  Defense  Department.  The  search  yielded  over 
references,  which  are  listed  in  the  2T9“P38®  annotated  bibliography. 

A synthesis  of  the  findings  is  based  on  abstracts  selected  for  their  relevance 
from  the  references.  Results  of  the  search  were  the  basis  for  developing  a 
manual  on  questionnaire  construction  designed  for  Army  personnel  responsible 
1,'r  lielii  evaluations^ 


DO  , 1473  EDITION  of'  I NOV  6S  IS  OBSOLETE 


Unclassified 


security  CLASSIFICATION  OF  THIS  PAGE  (Whan  Data  Bnlarad) 


r 


QUESTIONNAIRE  CONSTRUCTION  LITERATURE  SURVEY 


BRIEF 


Requirement : 

To  survey  and  review  the  literature  on  the  design  and  construction  of 
questionnaires,  seeking  to  synthesize  structural  and  procedural  improvements 
and  to  Identify  gaps  in  our  knowledge  of  the  effects  on  responses  to  questions 
of  various  factors. 


Procedure: 

The  literature  search  encompassed  journals,  books,  and  reports  in  the  fields 
of  psychology,  sociology,  education,  marketing,  and  documentation  published 
by  the  Department  of  Defense.  Both  "hand"  and  computer  searches  were  made. 

Findings: 

The  literature  search  yielded  over  2,000  references  on  questionnaire  methods. 
Abstracts  were  available  or  prepared  by  the  contractor  for  about  ,000  articles, 
The  findings  are  organized  in  twelve  chapters  and  are  followed  by  a 279  page 
bibliography. 


Utilization: 

The  content  of  this  literature  survey  may  be  of  greatest  interest  and 
value  to  full-time  or  professional  researchers  who  employ  questionnaires  in 
their  researches.  However,  a significant  portion  of  the  content  has 
entered  into  the  writing  of  an  ARI  instructional  manual  on  the  construction 
of  questionnaires.  The  manual  was  prepared  for  use  by  personnel  charged 
with  the  development  of  questionnaires  for  use  in  Army  field  tests  and 
evaluations . 


Hi 


TABLE  OF  CONTENTS 


Chapter  Page 


I INTRODUCTION  I-l 

II  ADVANTAGES  AND  DISADVANTAGES  OF  VARIOUS  TYPES  OF  QUESTIONNAIRES  II- 1 

Methods  to  Measure  Attributes  and  Behavior  II- 1 

Comparison  of  the  Structured  Interview  and  Mail 

Questionnaires  II -1 

Comparison  of  the  Structured  Interview  and  Other 

Questionnaires  II-3 

Comparison  of  Open-  and  Clnsed-Ended  Items 

Conclusions  II-5 

III  SELECTION  OF  QUESTIONNAIRE  ITEMS  III-l 

Content  of  Questionnaire  Items  III-l 

Methods  for  Determining  Questionnaire  Content  III-l 

Other  Considerations  Related  to  Questionnaire  Content  III-2 

Pros  and  Cons  of  Various  Types  of  Questionnaire  Items  III-3 

Ranking  Items  I II -3 

Rating  Scale  Items  III-5 

Multiple  Choice  Items  III-9 

Forced  Choice  and  Paired  Comparison  Items  III-ll 

Card  Sorts  III- 14 

Semantic  Differential  Items  III-15 

Other  Types  of  Items  III-17 

Conclusions  Regarding  the  Pros  and  Cons  of  Various 

Types  of  Questionnaire  Items  III-19 

IV  COiPARISON  OF  SCALING  TECHNIQUES  IV- 1 

V EFFECTS  OF  VARIATION  -IN  PRESENTATION  OF  QUESTIONNAIRE  ITEMS  V-1 

Mode  of  Items  V-1 

Wording  of  Items  V-1 

Clarity  of  Items  V-13 

Difficulty  of  Items  V-15 

Length  of  Question  Stem  V-18 

Order  of  Question  Stems  V-19 

Order  of  Response  Alternatives  V-25 

VI  NUMBER  OF  RESPONSE  ALTERNATIVES  AND  RESPONSE  ANCHORING  VI- 1 

Issues  Regarding  Number  of  Response  Alternatives  to  Employ  VI- 1 

Response  Anchoring  VI-9 

VII  ORDER  OF  PERCEIVED  FAVORABLENESS  OF  COMMONLY  USED  WORDS  AND  PHRASES  VII -1 

Major  Studies  and  Lists  of  Adjectives  and  Scale  Values  VII- 1 

Summary  and  Conclusions  VII-29 


Iv 


TABIi  OF  CONTENTS  (Cont.) 


Chapter  P®££ 


VIII  CONSIDERATIONS  RELATED  TO  THE  PHYSICAL  CHARACTERISTICS  OF 

QUESTIONNAIRES  VIII-1 

Location  of  Response  Altarnatlves  Relative  to  Stem  VIII-1 

Questionnaire  Length  VIII-1 

Questionnaire  Format  Considerations  VIII-2 

The  Use  of  Answer  Sheets  VIII-3 

IX  CONSIDERATIONS  RELATED  TO  THE  ADMINISTRATION  "OF  QUESTIONNAIRES  IX- 1 

Effects  of  Instructions  IX-1 

Effects  of  Various  Motivational  Factors  IX-2 

Effects  of  Anonymity  IX-6 

Effects  of  Administration  Time  IX-9 

‘ Effects  of  Characteristics  of  Questionnaire  Administrators  IX-10 

Effects  of  Administration  Conditions  IX-13 

Effects  of  Other  Factors  Related  to  Questionnaire 

Administration  IX-14 

X CHARACTERISTICS  OF  RESPONDENTS  THAT  INFLUENCE  QUESTIONNAIRE  RESULTS  X-1 

Item  Format  Biases  X-1 

Social  Desirability  Response  Set  X-2 

Acquiescence  Response  Set  X-3 

Extreme  Response  Set  X-4 

Effects  of  Attitudes  on  Responses  X-5 

Effects  of  Demographic  Characteristics  on  Responses  X-6 

Summary  and  Conclusions  X-7 

XI  CONSIDERATIONS  RELATED  TO  THE  EVALUATION  OF  QUESTIONNAIRE  RESULTS  XI -1 

Scoring  of  Questionnaire  Results  XI-1 

Properties  and  Uses  of  Ipsative  Scores  XI-3 

Data  Analyses  XI-6 

XII  RECOMMENDED  AREAS  FOR  FURTHER  RESEARCH  XII- 1 

Advantages  and  Disadvantages  of  Various  Types  of 

Questionnaires  XII -1 

Selection  of  Questionnaire  Items  to  be  Used  XII-1 

Comparison  of  Scaling  Techniques  XII-2 

Effects  of  Variation  in  Presentation  of  Questionnaire  Items  XII-2 

Number  of  Response  Alternatives  and  Response  Anchoring  XII-3 

Order  of  Perceived  Variables  of  Commonly  Used  Words  and 

Phrases  XII-3 

Considerations  Related  to  the  Physical  Characteristics  of 

Questionnaires  XII-3 

Considerations  Related  to  the  Administration  of  Questionnaires  XII-3 
Characteristics  of  Respondents  that  Influence  Questionnaire 

Results  XlI-4 

Considerations  Related  to  the  Evaluation  of  Questionnaire 

Results  XII-4 

General  Recommendations  XII-4 


BIBLIOGRAPHY 


V 


B-1 


LIST  OF  TABLES 


Table  V-1: 
Table  V-2: 

Table  V-3: 

Table  V-4: 
Table  V-5: 

Table  V-6: 


Summary  of  Studies  on  Mode  of  Items 
Summary  of  Research  on  Positive  versus  Negative 
Wording  of  Items 

Summary  of  Research  on  Objective  versus  Subjective 
Wording  of  Items 

Summary  of  Literature  on  Item  Difficulty 
Summary  of  Studies  Relating  to  the  Order  of  Question 
Stems 

Summary  of  Studies  on  Order  of  Response  Alternatives 


V-2 

V-5 

V-9 

V-16 

V-20 

V-27 


Table  VI- 1: 


Summary  of  Studies  Relating  to  Number  of  Response 
Alternatives 


VI-2 


Table  VII-1:  Scale  Values  of  Standard  Set  of  Words,  VII-2 

Table  VII-2:  Scale  Values  of  Selected  Words  ’^11-2 

Table  VII-3:  Words  Marked  "Unable  to  Rate"  by  20  or  More  Subjects  VTI-3 

Table  VII-4:  Words  Exhibiting  Marked  Bimodality  of  Response  VII-3 

Table  VII-  5:  Scale  Values  as  Affected  by  Adverbial  Modifiers  VII-3 

Table  VII-6:  Scale  Values  and  Standard  Deviations  of  Stimulus  Items  VII-5 

Table  VII-7:  Means  and  Standard  Deviations  of  Commonly  Used 

Statements  VII-6 

Table  VII-8;  Obtained  Successive  Intervals  Scale  Values  of  Adverb- 

Adjective  Combinations  VII-7 

Table  VII-9:  Adverb  and  Adjective  Value  Matrices  VII-8 

Table  VII-IO:  Numerical  Ratings  of  Adverb-Verb  Combinations  VII-9 

Table  VII-11:  Scale  Positions  for  Thirty-four  Phrases  VII- 10 

Table  VII-12;  Scale  Positions  of  47  Intensity  Phrases  VII-11 

Table  VII-13:  Stability  of  Intensity  Phrases  in  Diverse  Contexts  VII-12 

Table  VIl-14:  Ratings  of  Likableness,  and  Likableness  Variances 

for  Personality  Traits  VII-15 

Table  VII-15:  Means  and  Standard  Deviations  for  Phrases  of  Degrees 

of  Adequacy  VII-20 

Table  VII- 16:  Means  and  Standard  Deviations  for  Phrases  of  Degrees 

of  Acceptability  VII-22 

Table  VII-17:  Means  and  Standard  Deviations  for  Phrases  Used  for 

Comparison  VII -24 

Table  VII-18:  Scale  Scores  of  Statements  Based  on  Over-all 

Acceptability  VII-25 

Table  VII-19:  Meaning  of  Frequency  of  Words  VII-26 

Table  VII-20:  Correlations  of  Jones  & Thurstone  and  Myers  & Warner 

"Stale"  Values  VII-27 

Table  VII-21:  Correlations  of  Myers-Warner  and  Cliff  Scale  Values  VII-27 

Table  VII-22:  Summary  of  Perceived  Favorableness  of  Commonly  Used 

Words  and  Phrases  VII-28 


Vi 


INTRODUCTION 


ORA-Operations  Research  Associates  has  surveyed  and  reviewed  the 
literature  on  the  design  and  construction  of  questionnaires  as  part  of 
a contract  with  the  Army  Research  Institute  for  the  Behavioral  and 
Social  Sciences,  Fort  Hood,  Texas.  This  report  presents  the  results  of 
that  survey  and  review.  It  is  based  on  a broad  definition  of  question- 
naire to  include  scales,  structured  interview  forms,  survey  forms,  and 
similar  paper  and  pencil  instruments  used  to  elicit  responses  and  collect 
information. 

The  emphasis  of  this  review  was  on  questionnaires  used  with  Army 
personnel  participating  in  military  field  tests  concerned  with  evaluating 
training , equipment , organizations,  concepts,  and  doctrine,  but  little  was 
found  on  this  topic.  However,  since  considerations  affecting  question- 
naire construction  for  Army  field  test  evaluations  are  common  to  question- 
naire construction  for  other  uses,  this  review  covers  the  pertinent 
literature  from  other  fields.  The  review  was  not  concerned  with  the 
evaluation  of  soldier  attitudes  or  reactions  pertaining  to  societal  prob- 
lems sonality,  academic  testing,  or  similar  research  areas  except  as 

th'  ed  methodological  considerations  were  also  applicable  to  field 

Fmphasis  was  placed  on  those  sources  which  provided  empirical 
,..^stionnaire  construction.  Material  on  the  administration  and 
■s  of  questionnaires  and  on  questionnaire  application  and  results  was 
excluded  except  where  specifically  related  to  questionnaire  construction. 
Topics  not  stressed  in  the  literature  review  are  noted  as  appropriate  in 
the  text. 

The  literature  search  was  quite  comprehensive  and  included  the  re- 
view of  journals,  books,  and  reports  in  the  fields  of  psychology,  educa- 
tion, sociology,  marketing,  and  the  military.  Both  hand  and  computer 
searches  were  itiade.  Computer  searches  were  made  of  information  retrieval 
systems  maintained  by;  the  American  Psychological  Association  for 
Psychological  Abstracts  covering  the  years  1967  to  1974;  the  Educational 
Resources  Information  Center  for  the  years  1957  to  1974;  the  National 
Technical  Information  Service  for  1963  to  1974;  the  Defense  Documentation 
Center;  and  the  Bureau  of  the  Census. 

Hand  searches  were  made  to  supplement  the  computer  searches  and  in- 
cluded the; 

Psychological  Abstracts  for  1949  through  1967 ; Annual  Reviews  of 
Psychology  for  1960  through  1974;  Journal  of  Marketing  for  1942  to 
1974;  Journal  of  Advertising  Research  for  1960  to  1974;  Journal  of 
Marketing  Research  for  1964  to  1974;  Business  Periodicals  Index  tor  1951 
to  1974;  and  Public  Administration  Information  Service  for  1949  to  1974. 


Hand  searches  were  also  made  of  several  bibliographies;  Goheen  and 
Kavruck  (1950)  covered  the  early  work  for  the  years  1929  to  1949;  Potter, 
Sharpe,  Hendee,  and  Clarke  (1972)  covered  more  recent  work;  the  ARI  field 
Unit  at  MASSTER,  Fort  Hood,  provided  a March,  1974,  short  bibliography 
on  the  subject;  and  the  in-process  bibliography  of  the  Army's  Test  and 
Evaluation  Command  was  also  reviewed.  Finally,  the  articles  abstracted 
were  reviewed  for  references,  as  were  recognized  pertinent  texts  and  staff 
personal  files. 

The  literature  search  yielded  a total  of  over  2,000  citations  on 
questionnaire  construction  and  methodology;  however,  abstracts  were  only 
available  or  prepared  for  about  half  of  the  citations.  This  limitation 
was  imposed  by  the  level  of  effort  available,  and  the  selections  were  made 
on  the  basis  of  the  apparent  relevance  of  each  citation,  judging  primarily 
from  its  title  or  abstract  if  available.  The  actual  writing  of  the 
following  chapters  was  based  on  a selection  from  these  abstracts,  with 
occasional  reference  to  the  actual  articles,  depending  on  the  organization- 
al needs  of  the  chapter  as  seen  by  its  author.  The  articles  actually 
cited  in  the  writing  are  included  in  the  attached  bibliography  and  are 
identified  by  asterisks. 

The  results  of  this  literature  search  were  used  as  a basis  for  the 
development  of  a manual  on  questionnaire  construction  (Dyer,  Matthews, 
Wright,  6c  Yudowitch,  1975).  The  manual  was  prepared  for  use  as  a guide 
by  personnel  charged  with  the  development  of  questionnaires  for  use  in 
Army  field  test  evaluations.  It  includes  chapters  on  topics  discussed 
in  this  report. 


In  the  text  which  follows.  Chapters  II  through  XI  were  selected  and 
organized  to  cover  comprehensively  and  with  minimal  overlap  the  technical 
objectives  of  the  study  contract  between  ORA  and  ARI.  These  chapters 
also  include  for  completeness  some  additional  parallel  items.  Chapter  II 
discusses  the  advantages  and  disadvantages  of  various  types  of  question- 
naires. Chapter  III  considers  the  selection  of  questionnaire  items  to  be 
used,  including  the  content  of  questionnaire  items  and  the  pros  and  cons 
of  using  various  types  of  questionnaire  items.  Chapter  IV  notes  articles 
about  various  scaling  techniques.  The  effects  of  variations  in  the  pre- 
sentation of  questionnaire  items  are  covered  in  Chapter  V,  while  Chapter 
VI  reviews  articles  on  the  number  of  response  alternatives  and  response 
anchoring.  The  order  of  perceived  favorableness  of  commonly  used  words 
and  phrases  is  the  topic  of  Chapter  VII.  Chapter  VIII  examines  consider- 
ations related  to  the  physical  characteristics  of  questionnaires,  while 
considerations  related  to  the  administration  of  questionnaires  are  covered 
in  Chapter  DC.  Characteristics  of  respondents  that  influence  questionnaire 
results,  including  various  biases  and  response  sets,  are  discussed  in 
Chapter  X,  while  Chapter  XI  is  devoted  to  considerations  related  to  the 
evaluation  of  questionnaire  results.  Finally,  Chapter  XII  notes  recom- 
mended areas  for  further  research  based  upon  either  Identified  gaps  in  the 
empirical  research  or  contradictions  among  studies. 


1-2 


Chapter  II 


ADVANTAGES  AND  DISADVANTAGES  OF  VARIOUS  TYPES  OF  QUESTIONNAIRES 


This  chapter  discusses,  to  the  extent  articles  were  available  on  the 
topic,  some  of  the  advantages  and  disadvantages  of  using  various  types  of 
questionnaires,  as  the  word  "questionnaire"  was  defined  in  Chapter  I.  In 
the  first  section  below,  methods  to  measure  attributes  and  behavior  are 
mentioned.  Next,  the  structured  interview  is  first  compared  with  mail 
questionnaires,  and  then  with  oth(_r  types  of  questionnaires.  Comparisons 
between  open-  and  closed-ended  items  are  then  discussed. 

Methods  to  Measure  Attributes  and  Behavior 

There  are  a number  of  techniques  of  data  collection  that  can  be 
used  to  measure  human  attributes  and  behavior,  some  of  which  have  been 
reviewed  by  Deri,  Dinnerstein,  Harding,  and  Pepitone  (1948).  The  methods 
include  observation,  personal  and  public  records,  specific  performances, 
sociometry,  interviews,  questionnaires,  rating  scales,  pictorial  techniques, 
projective  techniques,  achievement  testing,  and  psychological  testing,  among 
others.  For  this  review,  however,  attention  has  been  restricted  to  a more 
limited  number  of  data  collection  techniques:  certain  paper  and  pencil  types 
of  instruments  broadly  classed  as  questionnaires  as  defined  in  Chapter  I, 
and  including  only  some  of  the  techniques  mentioned  above.  A distinction 
iias  also  been  made,  in  the  text  to  follow,  between  open-ended  questionnaire 
items  and  closed-ended  items.  Open-ended  items  are  those  which  permit  the 
respondent  to  express  his  opinions  in  his  own  words  and  to  indicate  anv 
qualifications  he  wishes.  Closed-ended  items,  on  the  other  hand,  utilize 
response  alternatives,  such  as  multiple  choice  or  true-false.  Structured 
interviews  are  included  within  the  definition  of  questionnaire  used  since 
typically  an  interview  schedule  is  developed  and  employed  by  an  interviewer 
both  for  asking  questions  and  recording  responses  much  like  a self-admini- 
stered questionnaire  with  open-ended  items.  This  distinction  is  not  as 
clear  as  it  might  be,  however,  since  some  investigators  (such  as  Paradise 
and  Blankenship,  1951)  admit  of  orally  administered  questionnaires, 
structured  interviews,  and  unstructured  interviews.  In  any  case,  unstruc- 
tured interviews  are  outside  the  scope  of  this  review,  and  they  will  not 
be  discussed  further. 


Comparison  of  the  Structured  Interview  and  Mail  Questionnaires 

During  the  literature  review,  attention  was  given  to  articles  on  the 
use  of  mail  questionnaires  only  to  the  extent  that  the  information  might 
be  genera lizable  to  other  types  of  questionnaires.  Accordingly,  any 
articles  related  to  sampling  considerations,  correcting  variance  estimates 
for  non-response,  etc.,  were  ignored.  Since  the  use  of  mail  questionnaires 
involves  the  consideration  of  issues  that  do  not  pertain  to  the  use  of 
other  types  of  questionnaires,  they  are  discussed  separately. 


II- 1 


I 


4 


A number  of  criteria  was  employed  by  O'Dell  (1962),  who  compared 
personal  interviews  and  mail  questionnaires  using  identical  terms.  He 
found  an  interview  bias,  in  that  during  the  interviews  the  usage  of 
certain  types  of  products  was  understated  when  it  might  reflect  unfavorably 
on  the  respondent.  Wiseman  (1972),  compared  a mailed  questionnaire,  tele- 
ae  interview,  aid  personal  interview,  concluded  that  issues  involving 
ially  accepted  or  rejected  answers  will  effect  more  bias  in  interviews 
nan  in  questionnaires.  Ellis  (1948  ) similarly  found  more  self -revelatory 
or  unfavorable  responses  in  anonymous  mailed  questionnaires  than  in  inter- 
views. Ford  ( 1969)  asked  identical  questions  in  a mail  questionnaire 
followed  by  an  interview.  He  found  that  there  was  a consistency  of  response 
about  newspaper  readership  and  about  socioeconomic  factors,  but  inconsistency 
on  items  related  to  attitudes  and  opinions,  the  location  of  past  purchases, 
and  when  past  purchases  were  made.  A number  of  factors  may  have  influenced 
his  results,  however,  such  as  the  time  lapse  between  the  questionnaire 
completion  and  the  interview.  Williams  (1968)  noted  that  data  gathered 

by  telephone  interview  may  be  less  accurate  than  those  obtained  from  a i 

mail  questionnaire,  since  the  group  who  are  at  home  to  answer  the  tele-  [ 

phone  may  not  be  as  representative  as  those  to  whom  the  questionnaires  ! 

are  mailed.  i 

i 

The  comparative  costs  of  interviews  and  mailed  questionnaires  were 
discussed  in  five  articles.  Cahalan  (1951)  administered  a 23  oage  mailed 
questionnaire  to  1,051  Army  officers,  and  found  it  was  less  e. pensive, 
more  anonymous,  and  faster  than  the  interview  technique.  O'Dell  ('.962) 
reported  that  the  costs  of  interviewer  time  tended  to  outweigh  the  costs 
of  obtaining  and  maintaining  a mail  panel.  Gibson  and  Hawkins  (1968) 
concluded  that,  under  the  promise  of  anonymity,  the  questionnaire  should 
equal  the  interview  in  response  information  at  a much  smaller  expense 
(although  there  is  some  question  about  the  survey  design  they  employed). 

The  degree  of  consistency  between  interview  and  questionnaire  results  found 
by  Parker,  Wright,  and  Clark  (1957)  also  raised  questions  concerning  the 
justification  of  the  expense  of  interviewing,  when  questionnaires  or 
similar  techniques  would  be  only  slightly  less  reliable.  Sudman,  Greeley, 
and  Pinto  (1965)  were  somewhat  more  conservative  in  their  conclusion, 
reporting  that  costs  were  not  significantly  affected,  regardless  of  whether 
interviews,  mail  questionnaires,  or  a combination  of  both  were  employed. 

Specificity  of  responses  was  discussed  only  by  O'Dell  (1962).  He 
noted  that  noncommittal  responses  and  the  tendency  not  to  answer  open- 
ended  questions  were  more  prevalent  for  mail  questionnaires  than  for 
personal  interviews,  as  might  be  expected. 

Combinations  of  survey  methods  were  discussed  in  three  articles. 

Sudman,  Greeley,  and  Pinto  (1965)  found  that  self-administered  ouestion- 
naires  used  in  conjunction  with  personal  interviews  elicited  a slightly 
higher  cooperation/return  rate  from  respondents  than  either  used  alone. 

The  result  that  comparisons  between  interviews,  self-administered  question- 
naires, and  a combination  of  both  did  not  indicate  any  large  differences 


I 


II-2 


suggested  to  them  that  additional  flexibility  should  be  considered  in 
methods  of  survey  research.  Payne  (1964)  also  suggested  that  sometimes 
a combination  of  survey  methods,  such  as  personal  interview,  telephone 
interview,  and  mail  questionnaires,  may  be  used  with  the  same  respondents 
to  produce  results  more  efficiently  than  one  method  alone  could  do.  How- 
ever, he  presented  no  firm  evidence  of  higher  reliability  or  validity 
for  combined  methods  over  individual  survey  methods.  Sharp  (1955)  found 
that  when  respondents  were  unable  to  give  complete  information  during  an 
interview  and  copies  of  a questionnaire  were  left  to  be  mailed  back, 

407o  were  returned  thus  eliminating  the  necessity  for  call-backs. 


Comparison  of  the  Structured  Interview  and  Other  Questionnaires 

Most  of  the  studies  comparing  the  structured  interview  with  question- 
naires other  than  mail  questionnaires  did  so  in  terms  of  the  consistency 
of  response  from  one  technique  to  the  other.  For  example,  Bennett,  Alpert, 
and  Goldstein  (1954),  though  working  with  only  16  subjects,  found  that 
26  out  of  30  questions  showed  significant  consistency  of  response  from 
a one  hour  interview  immediately  followed  by  the  use  of  a limited  response 
questionnaire  on  the  same  topic.  Consistency  coefficients  reported  were 
1.00  (perfect)  for  sociological  information,  .78  on  knowledge,  .69  on 
past  behavior,  and  .46  on  attitudes.  The  conclusion  reached  was  that  on 
information  other  than  sociological,  differences  in  response  will  be  noted 
between  interview  and  limited  choice  questionnaires,  especially  concerning 
attitudes . 

The  results  obtained  by  Bennett,  Alpert,  and  Goldstein  (1954)  appear 
to  have  been  supported  in  part  by  two  other  investigations.  Walsh  (1967) 
compared  the  accuracy  of  the  interview,  questionnaire,  and  personal  data 
blank  for  collecting  verifiable  biographic  information.  Comparing  collected 
data  to  available  records  for  270  students,  he  found  no  differences,  and 
concluded  that  biographic  data  may  be  collected  reliably  by  the  most  effi- 
cient means.  Boulger  (1970)  also  found  that  the  validity  of  response  to 
interviews  and  questionnaires  was  not  significantly  different  in  the 
elicitation  of  life  history  data. 

Three  studies  compared  structured  interviews  and  other  questionnaires 
in  the  measurement  of  attitudes.  In  the  first,  Metzner  and  Mann  (1952) 
followed  a fixed  alternative  questionnaire  administered  to  328  employees 
with  an  open-ended  interview.  They  noted  a tendency  for  the  employees  to 
rate  slightly  higher  in  the  interview  than  on  the  questionnaire.  There 
were,  however,  a number  of  limitations  to  the  study,  including  a two  iiionth 
time  lapse  between  completion  of  the  questionnaire  and  the  interview.  In 
the  second  study,  Wedel]  and  Smith  (1951)  found  that  interviewers  overesti- 
mated attitude  in  comparison  with  self-judged  attitude,  although  the  objective 
rating  of  interview  record  sheets  was  closer  to  self-rating  than  the  inter- 
viewers' rating.  Wheatley  (1973),  however,  found  no  significant  differences 
between  mean  scale  scores  for  two  groups,  one  of  which  expressed  their 
attitudes  during  a telephone  interview,  while  the  other  group  responded 
on  a self -administered  questionnaire. 


Although  studies  involving  the  use  of  questionnaires  for  the  measure- 
ment of  personality  were  generally  excluded  from  the  literature  review, 
three  were  considered  in  that  they  comnared  results  obtained  from  interviews 
and  questionnaires.  Eysenck  and  Eysenck  (1962)  sought  to  answer  the  question 
of  whether  an  interview-questionnaire  would  reveal  a factorial  structure 
essentially  identical  to  that  fouid  with  questionnaires  administered  in  the 
orthodox  manner.  The  results'  indicated  that  the  method  of  administration 
did  not  affect  the  factorial  composition  of  the  items,  which  measured 
extraversion  and  neuroticism.  Ambler,  Blair ,deRivera , Nelson,  and  Schoen- 
berger  (1958)  also  found  that  the  interview  and  questionnaire  methods  gave 
similar  results  in  the  classification  of  subjects  -according  to  three  levels 
of  anxiety  towards  flying.  The  conclusions  reached  by  Levonian  (1963), 
however,  were  different.  He  determined  the  reliability  of  three  short 
personality  scales  administered  by  the  interview  survey  method  to  432 
subjects.  The  values  were  sufficiently  less  than  the  consistency 
reliabilities  of  short  scale  personality  measures  obtained  by  the  usual 
questionnaire  survey  method  to  raise  serious  questions  about  the  adequacy 
of  such  personality  measures  obtained  by  the  interview  method. 

A comparison  of  interview  and  other  questionnaire  results  when  ego- 
involving questions  were  asked  was  the  topic  of  two  reports.  Knudsen, 

Pope,  and  Irish  (1967)  concluded  that  interviews  may  lessen  the  expression 
of  deviance,  compared  with  anonymous  questionnaires.  Based  on  three 
different  samples  of  white  women  all  of  whom  were  or  had  been  premaritally 
pregnant  for  the  first  time,  the  data  suggested  that  in  interview  situations 
respondents  were  more  likely  to  support  the  public  and  restrictive  sexual 
norms  that  they  assumed  were  adhered  to  by  the  interviewer.  In  the  private 
and  anonymous  questionnaire  situation,  the  respondents  more  often  answered 
to  subcultural  norms.  Ellis  (1947b)  compared  the  questionnaire  and  interview 
methods  in  the  study  of  human  love  relationships.  His  results  indicated 
that  the  great  majority  of  subjects  gave  less  favorable,  or  more  incrimi- 
nating, responses  to  the  questionnaires  than  they  did  to  the  interview. 

Ellis  concluded  that  for  more  ego-involving  questions  the  questionnaire 
may  produce  more  self -revelatory  data  than  the  interview. 


Comparison  of  Open-  and  Closed-Ende J Items 

Of  the  five  articles  that  compared  the  use  of  open-ended  and  closed- 
ended  questionnaire  items,  three  appeared  to  favor  the  use  of  the  open- 
ended  format,  at  least  for  the  factors  considered.  Ellenbogen  and  Danley 
(1962),  in  a study  of  the  comparability  of  responses  to  a socially 
concordant  question,  found  that  responses  were  more  varied  to  the  open- 
ended  question  than  to  the  closed,  although  the  closed  had  an  "other" 
category.  Asking  about  resources  of  helpful  health  advice,  they  also 
found  that  19%  of  the  responses  were  inconsistent,  in  that  sources  of  advice 
cited  in  the  open  question  were  omitted  in  the  closed. 

England  (1948)  compared  open-ended  and  dichotomous  items  about 
capital  punishment  in  three  survey  samples  of  2,000,  3,000,  and  6,000. 

The  results  gave  preference  to  the  ope.i-ended  items,  since  they  allowed  for 


TT-A 


the  expression  of  middle  party  opinions  that  the  dichotomous  items  forbid. 
However,  in  coding  the  open-ended  items,  expert  analysts  were  required  to 
obtain  reliable  results. 

The  results  of  a computer-assisted  method  of  free  response  (after 
which  the  respondents  evaluated  the  responses  they  generated  on  a rating 
scale)  was  compared  with  responses  to  prelisted  statements  in  a study  by 
Kohan,  deMille,  and  Myers  (1972).  Although  no  significance  tests  were 
reported,  the  free  response  method  appeared  to  generate  response  categories 
that  differed  rather  substantially  from  the  prelisted  statements.  Issues 
of  importance  that  were  overlooked  by  the  questionnaire  developers  were 
identified.  It  was  concluded  that  reliance  on  the  conventional  method 
may  distort  a study's  focus  by  obtaining  data  on  items  not  of  real  concern 
and  having  no  accurate  means  to  measure  concern.  The  authors  also 
noted  that  high  affirmative  levels  for  an  item  can  often  be  interpreted 
as  a response  set  or  lip  service,  while  responses  generated  by  unstructured 
methods  are  probably  more  reflective  of  personal  involvement  or  concern. 

The  study  favoring  the  use  of  close-ended  items  was  by  Scates  and 
Yoemans  (1950a).  It  was  undertaken  by  the  American  Council  on  Education 
to  determine  the  value  of  objective  tests  for  identifying  those  scientists 
and  engineers  who  were  likely  to  undertake  further  education.  It  was  con- 
cluded that  the  use  of  objective  tests  was  more  advantageous  than  the 
several  depth  essay  questions  used  in  a previous  study,  because  they  took 
less  time  and  were  therefore  more  acceptable  to  the  employees. 

The  best  summary  for  this  section  was  stated  by  Prien,  Otis,  Campbell  & 
Saleh  (1964).  They  noted  that  the  open-ended  type  of  questionnaire  has 
the  advantage  of  providing  unique  information,  whereas  the  objective  type 
of  questionnaire  is  generally  more  reliable.  The  combination  of  both,  they 
said,  would  appear  to  be  best. 


Conclus ions 


The  decision  about  which  type  of  questionnaire  to  use  depends  upon 
the  specific  research  question  that  one  is  attempting  to  answer  and  the 
practical  limitations  involved.  Both  structured  interviews  and  other  types 
of  questionnaires  appear  to  have  their  place  in  research  studies,  and  both 
have  have  their  limitations.  The  choice  of  which  to  use  may  well  depend 
upon  costs,  which  are  generally  lower  for  the  typical  questionnaire.  The 
typical  questionnaire  is  apparently  more  reliable,  while  the  structured 
interview  may  provide  more  unique  information.  If  the  dimensions  of  a 
problem  have  not  been  explored  before,  the  best  compromise  would  appear  to 
be  to  use  the  interview  approach  with  open-ended  items  to  uncover  the  dimen- 
sions, and  follow  this  by  the  use  of  the  more  reliable  paper  and  pencil 
questionnaire  to  obtain  more  specific  information. 


Chapter  III 


SELECTION  OF  QUESTIONNAIRE  ITEMS  TO  BE  USED 

Once  a decision  has  been  made  as  to  the  type  of  questionnaire 
instrument  to  use  (the  topic  of  Chapter  II),  the  specific  questionnaire 
items  to  be  administered  need  to  be  selected.  The  two  main  sections  in 
Chapter  III,  then,  address  the  content  of  questionnaire  items  and  the  pros 
and  cons  of  various  types  of  questionnaire  items. 


Content  of  Questionnaire  Items 

This  section  considers  first  methods  for  determining  questionnaire 
content,  and  then  other  issues  related  to  questionnaire  content. 

Methods  for  Determining  Questionnaire  Content 


There  are  a number  of  ways  that  can  be  used  to  determine  questionnaire 
content.  One  of  these  that  is  not  too  well  known  is  the  critical  incident 
technique.  As  noted  by  Flanagan  (1954)  the  critical  incident  technique 
consists  of  a set  of  procedures  for  collecting  direct  observations  of 
human  behavior  in  such  a way  as  to  facilitate  their  potential  usefulness 
in  both  solving  practical  problems  and  in  developing  broad  psychological 
principles.  The  technique  outlines  procedures  for  collecting  observed 
incidents  of  behavior  having  special  significance  and  meeting  systematically 
defined  criteria.  It  can  be  of  assistance,  therefore,  in  helping  to  deter- 
mine the  content  of  items  to  be  included  in  questionnaires.  Although  many 
articles  on  the  technique  have  been  published,  they  were  not  all  reviewed  in 
conjunction  with  preparing  this  review.  One  article  on  the  topic  was 
prepared  by  Barnes  (1960),  who  gave  an  historical  sketch  of  the  develop- 
ment of  the  technique , and  outlined  the  procedures  to  follow  in  using  this 
approach  for  social  research.  The  procedures,  representing  one  way  that 
the  critical  incident  technique  can  be  used,  included:  determining  the 

alms  of  the  investigation;  securing  competent  reporters  or  observers; 
collecting  the  critical  incidents  of  behavior  actually  observed;  selecting 
those  incidents  to  be  included  in  the  final  study;  analyzing  and  classifying 
the  data;  and  interpreting  the  findings. 

Another  method  for  selecting  items  for  an  attitude  scale  was  used  by 
Alilunas  (1949),  who  was  concerned  not  only  with  finding  out  what  people 
think  about  an  issue,  but  how  they  think  about  matters  on  which  they  are 
asked  to  give  an  opinion.  The  method  starts  with  asking  a group  of 
individuals  to  write  six  statements  giving  tneir  impressions  of  a topic, 
such  as  capitalism.  From  these,  some  smaller  number  of  statements  are 
selected  that  are  readable,  intelligible,  and  capable  of  classification. 
These  statements  can  then  be  sorted  into  several  categories,  such  as  the 
status  of  the  topic  and  its  good  and  bad  features. 


III-l 


Yet  another  way  of  developing  closed-ended  questionnaire  items  is  to 
evaluate  the  responses  to  corresponding  open-ended  items,  as  suggested  bv 
authors  such  as  Payne  (1965).  Reporting  on  a computer-assisted  method  of 
free  response  analysis  where  respondents  give  answers  they  think  appropriate 
and  then  rate  their  answers  according  to  dimensions  specified  on  a scale, 
Kohan,  deMille,  and  Myers  (1972)  stated  that  the  method  Identified  issues 
of  importance  that  had  been  overlooked  by  questionnaires  developers  either 
because  of  their  own  biases  or  Imperfect  knowledge.  They  also  noted  that 
reliance  on  the  conventional  method  may  distort  a study's  focus  by  obtaining 
data  on  items  not  of  real  concern  and  having  no  accurate  means  to  measure 
concern.  Also  on  the  topic  of  bias,  Schuessler  (1952)  questioned  the 
randomness  of  item  selection  in  scale  analysis.  He  showed  data  to  indicate 
that  differences  among  investigators'  definition  of  the  universe  and  bias 
in  selecting  items  both  effect  their  results.  He  concluded  that  much 
more  critical  attention  is  needed  by  the  researchers  to  avoid  their  own 
biases  and  influence  in  gathering  data  for  analyses. 

Hart,  Faust,  Rowland,  and  Lucier  (1964),  in  a report  on  attitudes  of 
troops  in  the  tropics,  noted  that  the  sentence  completion  technique  is 
useful  for  assessing  topic  and  dimension  saliency,  and  for  validating  the 
objective  techniques.  They  also  reported  that  a listing  technique  is 
valuable  for  identifying  salient  topic  dimensions  and  salient  topics,  and  for 
updating  instruments  which  are  developed  on  pilot  samples  and  used  on 
larger  populations.  They  feel  that  considerable  effort  should  be  exerted 
to  identify  the  salient  topical  dimensions,  their  levels,  and  their  inter- 
relationships whenever  an  objective  scaling  technique  is  used. 


Other  Considerations  Related  to  Questionnaire  Content 

This  i^ectlon  discusses  a niimber  of  diverse  topics,  all  of  which  are 
related  in  some  way  to  questionnaire  content. 

Five  obstacles  to  the  selection  of  appropriate  questions  to  test  social- 
psycholcgical  variables  were  discussed  by  Bradburn  (1970).  They  are; 

1.  Lack  of  agreement  among  behavioral  scientists  about  the  appropriate 
social-psychological  dependent  variables  that  are  relevant  to  particular 
social  programs; 

2.  An  inadequate  conceptualization  of  those  social-psychological 
variables  that  are  suggested  for  study; 

3.  A relative  lack  of  interest  in  systematic  methodological  research 
and  survey  measurement; 

4.  The  relative  underdevelopment  of  measurement  theory  in  survey  work; 

and 

5.  The  special  historical  and  cultural  problems  that  affect  the 
phraseology  of  questions. 


Among  the  principles  reported  by  Blankenship  (1942)  that  he  believes 
should  be  followed  in  the  wording  of  preference-type  questions,  those 
relating  to  questionnaire  content  are:  to  be  psychologically  sound,  a 

question  should  ask  about  past,  present,  or  future  behavior,  rather  than 
hypothetical  opinion;  the  questions  should  not  damage  the  pride  of 
respondents,  and  the  first  few  questions  used  must  secure  rapport  with  the 
respondent . 

The  fact  that  questionnaire  items  can  produce  variable  distortion  was 
pointed  out  in  the  report  of  a study  by  Klein,  Maher,  and  Dunnington  (1967). 
Items  dealing  with  salary  and  with  ratings  of  top  management  produced 
consistent  positive  distortions,  whereas  items  dealing  with  work  pressures 
and  the  respondent's  manager  produced  little  or  no  distortion  even  under 
conditions  of  high  threat.  Dunnette  and  Heneman  (1956)  also  noted  that 
a threat  to  anonymity  results  in  differential  amounts  of  response  distortion, 
depending  upon  the  content  of  different  items  comprising  the  questionnaire. 

Marquis,  Marshall,  and  Oskamp  (undated)  reported  on  a study  of  the 
accuracy  and  completeness  of  testimony  as  a function  of  kind  of  questions. 

They  found  that  for  items  of  low  salience,  structured  questioning  resulted 
in  more  compiet'.  but  less  accurate  responses.  However,  for  items  of  high 
salience,  more  structured  questioning  did  not  reduce  accuracy.  Similarly, 
Miklich  (1966)  found  that,  if  an  ambiguous  item  was  important,  the  tendency 
was  to  agree  with  it.  If  it  was  unimportant,  the  tendency  was  to  disagree. 

Two  studies  considered  the  reliability  of  various  types  of  questionnaire 
items  as  a function  of  content.  Cavan  (1933)  concluded  that  questions  involving 
attitudes  or  estimates  have  lower  reliability  than  factual  questions,  and  that 
reliability  is  increased  by  avoiding  fine  detail.  Guber  and  Gerberich  (1946), 
on  the  other  hand,  found  that  factual  questions  showed  the  least  reliability. 

Finally,  Spector  (1957)  demonstrated  that  the  test  user's  values  and 
needs  do,  and  should,  enter  into  judgments  made  during  the  construction  and 
validation  of  an  attitude  test. 


Pros  and  Cons  of  Various  Types  of  Questionnaire  Items 

This  section  presents  the  pros  and  cons  of  various  types  of  question- 
naire items  as  obtained  from  the  literature  reviewed.  Included  are:  ranking 
items;  rating  scale  items;  multiple  choice  items;  forced  choice  and  paired 
comparison  items;  card  sorts;  semantic  differential  items;  and  other  types 
of  items.  As  appropriate,  comparisons  of  item  types  are  included,  except 
for  a comparison  between  open-ended  and  closed-ended  items  generally, 
which  was  discussed  in  Chapter  JI. 


Ranking  Items 


Comparison  of  ranking  and  rating  scales.  Five  articles  were  abstracted 
that  compared  ranking  and  rating  methods.  Bittner  & Rundquist  (1950)  described  the 


rank  comparison  rating  method, and  noted  that  comparisons  with  other  studies 
revealed  that  the  method  gives  results  closely  related  to  rank  comparison. 
Murphy,  Bailey,  and  Covell  (1954)  found,  in  judging  frozen  strawberries, 
that  rating  provided  better  discrimination  than  ranking  when  ten  judges 
were  used.  However,  Rennick,  Grupe , Reich,  and  Sewell  (1954)  found 
rankings  to  be  more  reliable  than  ratings  when  professional  staff  both 
ranked  and  rated  parents' descriptive  reports  of  their  children's  growth 
in  specific  character  attitudes.  Bartlett,  Heermann,  and  Rettig  (1960) 
found  that,  for  a single  judge,  the  ranking  and  paired  comparison  tech- 
niques were  superior  in  reliability  to  the  Likert,  graphic  rating,  and 
equal  appearing  intervals  techniques.  Kassarjian  and  Nakanlshi  (1967), 
however,  found  comparability  between  ranking  and  Likert -type  scaling 
based  on  reliabilities  and  inter-method  correlations  when  methods  were 
compared  for  the  selection  of  a brand  name  for  a ficticious  new  phonograph. 

Comparison  of  ranking  and  paired  comparisons.  There  appears  to  have 
been  contradictory  evidence  obtained  when  the  ranking  and  paired  comparisons 
methods  were  compared.  Wilkins  (1950)  , using  300  men  randomly  selected 
from  British  army  reception  centers,  found  that  the  two  methods  did  not 
yield  similar  results  when  the  importance  of  eight  characteristics  of 
jobs  were  considered.  The  observed  differences  did  not  appear  to  be 
systematic  or  biased,  although  characteristics  of  least  importance  varied  the 
most  between  methods.  Witroyl  & Thompson  (1953)  found  that  a paired  comparison 
questionnaire  was  a more  stable  measure  than  a partial  rank  order  form  of 
a social  acceptance  questionnaire  administered  to  about  80  sixth  grade 
students.  They  noted  that  this  may  be  due  to  the  larger  number  of  responses 
required  in  the  paired  comparison  form.  They  also  said  that  this  form  is  a 
more  sensitive  measure  of  the  status  of  individuals  in  the  middle  range 
of  the  acceptability  continuum,  and  offers  relatively  more  general 
measures  of  social  status.  The  partial  rank  order  scales  may  reflect 
more  personal  and  situational  factors.  Also  in  favor  of  paired  comparisons, 
Cohen  (1967)  suggested  that  the  ranking  of  stimuli  produces  a statistical 
artifact  that  can  be  corrected  and  controlled  by  paired  comparison 
analysis.  The  artifact  is  the  inability  of  ranking  to  detect  the  compara- 
tive position  of  each  stimulus  in  relation  to  each  other  stimulus.  Fenner, 
Homant , and  Rokeach  (1968)  compared  the  rank  order  and  paired  comparison 
methods  of  measuring  terminal  and  instrumental  values.  For  the  terminal 
values,  the  paired  comparison  reliability  was  significantly  higher,  while 
for  Instrumental  values  the  difference  was  not  significant,  the  trend  being 
in  the  opposite  direction.  The  authors  concluded,  however,  that  the  benefit 
of  the  paired  comparison  method  as  compared  with  the  rank  order  method  is 
doubtful.  The  results  suggested  that  the  paired  comparison  method  should 
be  employed  when  measuring  value  systems  only  if  there  is  a principal  concern 
with  the  terminal  values  and  if  the  time  and  effort  expended  in  testing, 
scoring,  coding,  etc.,  are  not  important  considerations.  Bernard  (1933) 
had  come  to  a similar  conclusion,  noting  that  the  method  of  ranking  is  not 
inferior  in  reliability  to  that  of  paired  comparisons.  He  also  stated  that, 
since  it  took  twice  as  long  for  the  judges  to  complete  the  paired  comparisons 
as  the  ranking,  the  latter  was  the  superior  method. 


III-4 


The  results  from  three  other  studies  found  essentially  no  differ- 
ences between  the  methods  of  ranking  and  paired  comparisons.  Eng  and 
French  (1948),  comparing  sociometric  and  psychological  methods  of  scaling, 
found  a near  perfect  correlation  between  mean  ranks  and  paired  comparisons. 
Kassarjian  and  Nakanishi  (1967),  in  the  study  noted  in  the  previous  section, 
also  found  comparability  between  ranking,  paired  comparisons,  and  open 
choice  preferences.  Slater  (1965)  also  found,  from  four  experiments, 
comparability  of  ranking,  paired  comparisons,  and  other  forced  choice 
comparisons  for  recording  personal  preferences.  He  concluded  that  the 
whole  weight  of  the  evidence  is  in  favor  of  the  view  that  an  informant, 
when  expressing  his  personal  preferences,  tends  to-maintain  a level  of 
reliability  which  characterizes  him  as  an  individual,  and  is  unaffected 
either  by  variations  in  the  number  of  objects  he  is  given  to  compare  or 
changes  in  the  methods  he  is  asked  to  use. 

The  relationship  between  ranking  and  the  method  of  paired  comparisons 
was  reported  by  Ross  (1955)  and  Pauli  (1968).  Ross  showed  that,  when  N 
judges  are  asked  to  indicate  their  preferences  for  n items  by  both  the 
method  of  paired  comparisons  and  the  method  of  rank  order,  a linear 
relationship  holds  between  the  total  number  of  choices  from  the  paired 
comparison  method  and  the  mean  rank  from  the  rank  order  method.  Pauli 
(1968)  studied  the  reliability  of  results  obtained  by  the  psychophysical 
methods  of  rank  ordering  and  paired  comparisons  when  subjects  are  ego 
involved  in  the  material  being  judged.  He  found  that  scales  derived  by 
the  two  methods  are  linear  in  relationship. 


Rating  Scale  Items 

Comparison  of  rating  scale  and  multiple  choice  items.  Only  one  study 
comparing  rating. scale  and  multiple  choice  items  is  reported  here,  since  a 
majority  of  such  studies  involve  issues  regarding  the  number  of  response 
alternatives  to  employ  and  are  hence  discussed  below  in  the  first  section 
of  Chapter  VI.  Greenwald  and  O'Connell  (1970)  conducted  a study  to  test 
previous  findings  that  suggested  that  dichotomous  measures  yielded  similar 
but  not  equivalent  information  to  that  of  Likert  scales.  The  results  showed 
that,  as  in  previous  studies,  the  true-false  and  Likert  methods  correlated 
significantly.  However,  the  Likert  format  produced  the  higher  item-total 
correlations.  The  greater  internal  consistency  for  the  Likert  approach 
suggested  a possible  advantage  for  Likert  scaling. 

Comparison  of  rating  scale  items  and  forced  choice  or  paired  comparison 
items.  In  this  chapter,  forced  choice  and  paired  comparison  items  are 
discussed  together  since  the  latter  is  but  a special  case  of  the  former, 
using  duads. 


III-5 


A study  by  Pilgrim  and  Wood  (1955)  compared  the  sensitivity  of  rating 
stale  and  paired  comparison  methods  for  measuring  consumer  food  preferences 
under  laboratory  conditions.  The  methods  were  found  to  be  equally  sensitive 
whether  the  differences  in  preference  were  large  or  small.  Similarly, 
Greenberg  (1963)  found  no  significant  differences  between  rating  scale  and 
paired  comparison  tests  used  in  consumer  product  tests. 

In  the  attitude  measurement  area,  Neidt  and  Merrill  (1951)  compared 
five  point  rating  scales  and  paired  (positive  and  negative)  statements. 

They  found  that  each  showed  about  equal  validity  coefficients.  Although 
the  reliability  of  the  rating  scale  was  somewhat  higher  than  that  of  the 
paired  comparison  form,  the  authors  felt  that  there  are  advantages  to 
the  latter  which  warrant  its  consideration  under  some  circumstances. 

Horst  and  Wright  (1959)  also  obtained  higher  reliability  for  a self-appraisal 
personality  rating  scale  than  for  a paired  comparison  inventory  composed  of 
the  same  items,  although  the  rating  scale  scores  were  arithmetically  ipsa- 
tized.  (See  Chapter  XI  for  a discussion  on  the  properties  and  uses  of 
ipsative  scores.)  The  rating  scale  also  required  only  about  one-third  the 
time  to  administer  than  the  paired  comparison  test. 

A personality  questionnaire  and  forced  choice  personality  test  were 
compared  by  Gordon  (1951).  Both  had  the  same  factor  structure  and  much 
the  same  item  content,  and  were  constructed  by  the  method  of  internal 
consistency.  For  all  four  personality  scales  the  forced  choice  method  was 
found  to  be  more  valid  than  the  questionnaire  method,  using  descriptive 
nominations  by  associates  as  the  criterion.  Multiple  correlations  indicated 
that  the  questionnaire  data  added  nothing  towards  the  prediction  of  the 
criteria  when  placed  in  a battery  with  the  forced  choice  test. 

Scott  (1968)  did  a study  of  the  comparative  validities  of  self-report 
forced  choice  and  single  stimulus  tests.  He  noted  that  the  generalization 
that  forced  choice  personality  inventories  are  more  valid  than  single 
stimulus  forms  of  the  same  tests  was  not  supported  by  initial  examination 
of  the  relevant  evidence.  Apparently  only  one  study  that  claimed  superior 
validity  for  the  forced  choice  format  appeared  to  have  used  identical 
items  in  the  two  forms.  Other  studies  either  did  not  use  single  stimulus 
forms  for  comparison,  did  not  hold  item  content  constant  between  the  two 
forms,  or  else  yielded  nonconfirming  results.  He  reported  also  that  the 
most  tenable  conclusion  is  that  test  validity  does  not  depend  on  this 
characteristic  of  item  format  under  the  circumstances  in  which  these  self- 
report  inventories  are  typically  administered. 

Newhall  (1954)  compared  the  methods  of  paired  comparison  and  single 
stimuli  in  the  evaluation  of  a series  of  color  prints  and  color  transparencies. 
The  two  methods  produced  highly  correlated  results.  However,  the  method  of 
single  stimuli  was  preferred  as  being  the  more  efficient  method  in  making 
judgments  where  items  do  not  require  juxtaposition. 


1II-6 


All  of  the  studies  to  be  reviewed  in  the  rest  of  this  section  Involved 
the  use  of  judges  or  raters,  and  hence  may  not  be  comparable  to  studies 
based  upon  self-report.  Using  three  groups  of  judges,  the  paired  comparison 
and  equal  appearing  intervals  methods  of  scaling  attitude  statements  toward  teach- 
ing were  compared  empirically  by  Crawford  (1965).  He  said  that  the  two  methods 
appeared  comparable,  at  least  when  expert  judges  were  used.  The  use  of 
students  as  judges  in  this  type  of  study  was  questioned.  Students  rated 
occupational  status  in  the  study  by  Bartlett,  Heermann,  and  Rettig  (1960), 
where  little  difference  in  scale  values  or  reliability  for  mean  scale 
values  was  found  using  the  paired  comparisons,  Likert,  graphic  rating, 
and  equal  appearing  intervals  techniques.  The  paired  comparison  and 
ranking  techniques  were,  however,  found  to  be  superior  in  reliability  for 
a single  judge.  Using  85  subjects  to  judge  the  esthetic  value  of  seven 
handwriting  specimens,  Ekman  and  Kunnapas  (1960)  constructed  an  interval 
scale  by  the  method  of  paired  comparisons  and  a ratio  scale  by  a variant 
of  the  method  of  ratio  estimation.  They  both  gave  essentially  the  same 
results. 

A graphic  rating  scale  was  compared  with  six  kinds  of  forced  choice 
forms  for  rating  Air  Force  technical  Instructors,  by  Berkshire  and  Highland 
(1953).  Scores  from  the  graphic  rating  scale  exhibited  relatively  little 
bias  and  had  as  high  validity  as  the  best  of  the  forced  choice  scales. 

Combining  the  scores  from  the  graphic  and  forced  choice  scales  yielded 
validity  coefficients  substantially  higher  than  for  either  alone.  The  use 
of  forced  choice  items  and  both  eight  and  five-step  graphic  scales  was 
compared  in  a study  conducted  by  the  U.S.  Department  of  the  Army  (1952)  using 
400  of f icers as  a rater-ratee  population.  The  eight-step  graphic  scale  had 
the  highest  validity  (.53).  A study  by  Staugas  and  McQuitty  (1950), 
however,  found  the  forced  choice  method  superior  to  the  use  of  a graphic 
scale.  But  Bayroff,  Haggerty,  and  Rundquist  (1954)  found  that  two  types 
of  graphic  rating  scales  and  two  modifications  of  the  forced  choice  tech- 
nique did  not  differ  markedly  in  validity. 

Susceptibility  to  errors  was  the  concern  of  two  other  investigations. 
Leftwich  and  Remmers  (1962)  compared  graphic  and  forced  choice  (tetrad) 
ratings  of  teacher  performance.  Distributions  and  intercorrelations  of 
mean  item  and  mean  total  scores  showed  the  graphic  form  relatively  more 
susceptible  to  errors  of  leniency  and  halo.  Item  intercorrelations  were 
also  higher  in  general  for  the  graphic  form.  The  authors  noted,  in  addition, 
that  the  forced  choice  form  was  susceptible  to  fakability,  relative  to  the 
transparency  of  any  forced  choice  tetrad.  Bartlett  and  Sharon  (1969) 
determined  the  effects  of  several  instructional  rating  conditions  on  leniency 
on  a graphic  and  forced  choice  rating  scale.  A significant  leniency  effect 
was  found  with  the  graphic  ratings  which  were  to  be  used  for  evaluation 
purposes  and  those  which  had  to  be  justified  to  the  ratee,  but  presumably 
not  with  those  that  were  anonymous  or  were  identified  by  having  the  rater 
place  his  name  on  the  form.  It  was  concluded  that  the  forced  choice  scale 
was  quite  resistant  to  leniency  bias. 


Comparison  of  rating  scale  items  and  card  sorts.  Two  studies  compared 
the  use  of  rating  scales  and  card  sorts,  both  in  terms  of  determining  scale 
values.  Seashore  and  Hevner  (1933)  substituted  a nine  point  scale  for  each 
item  for  the  standard  method  of  sorting  items  into  nine  piles  from  separately 
printed  slips.  The  rating  method  saved  87%  of  the  time  in  assembling 
materials  and  50%  of  the  time  in  tabulating  results  Involved  in  making 
attitude  scales  by  Thurstone's  method  of  equal  appearing  intervals.  The 
subjects  found  the  task  easier  and  more  pleasant,  and  the  results  showed 
negligible  differences  in  the  medians  or  scale  values  of  the  items,  and 
in  the  difference  or  spread  of  opinion  (Q  value)  in  regard  to  them. 

An  investigation  of  the  stability  of  median  and  Q values  computed  from 
graphically  derived  and  from  sorted  judgments  used  in  scaling  by  the  method 
of  equal  appearing  intervals  was  conducted  by  Siegel  and  Siegel  (1962). 

Graphic  judgments  using  a nine  point  scale  tended  to  yield  higher  Q values 
than  nine  pile  sorts  for  relatively  unambiguous  items.  The  medians  derived 
from  the  two  procedures  correlated  .97. 

Comparison  of  rating  scale  and  semantic  differential  items.  In  the 
study  by  Hart,  Faust,  Rowland,  and  Lucier  (1964)  on  the  attitudes  of  troops 
in  the  tropics,  it  was  concluded  that  Osgood's  semantic  differential  tech- 
nique was  clearly  superior  to  Likert's  agree/disagree  method  of  summated 
ratings.  They  went  on  to  note  that,  for  most  purposes,  attitudinal  data 
collection  efforts  in  which  objective  questionnaires  are  used  should  consist 
of  some  form  of  the  semantic  differential  scaling  technique  as  opposed  to 
agree/disagree  versions  of  Likert's  method  of  summated  ratings. 

Hughes  (1967)  compared  the  use  of  Thurstone  and  modified  semantic 
differential  scales  (with  a "no  information"  category)  in  a questioiinaire. 

None  of  the  Thurstone  scales  detected  attitude  change,  but  28%  of  the 
semantic  differential  scales  did.  Test-retest  reliability  was  .53  for  Thurstone, 
.58  for  the  semantic  differential.  In  addition,  the  semantic  differential 
increased  in  preference  as  the  respondents  became  used  to  it. 

Ward  (1969),  questionning  the  results  of  a previous  study,  found  that 
the  semantic  differential  is  no  more  vulnerable  to  changes  in  issue  saliency 
than  are  other  widely  used  measures  of  attitude. 

Comparison  of  rating  scale  and  check  list  items.  The  study  by  Hughes 
(1967)  referred  to  Immediately  above  also  included  adaptations  of  a check 
list  (e.g. , with  important,  unimportant,  and  no  opinion  categories)  on  the 
questionnaire  employed.  Eleven  percent  of  the  check  list  scales  detected 
attitude  change,  and  the  check  list  items  had  a test-retest  reliability  of 
.58,  the  same  as  the  semantic  differential  items. 

Likert-type  scaling  was  compared  with  the  use  of  various  types  of 
check  lists  by  Kassarjian  and  Nakanishi  (1967),  and  no  differences  in 
results  were  found.  In  the  Department  of  Army  study  (1952)  referred  to 
previously,  the  eight-step  graphic  rating  scale  was  also  found  to  have 
higher  validity  (.53)  than  when  a controlled  check  list  was  used  (.44). 

The  four  five-step  scales  had  validities  ranging  from  .39  to  .44. 


TTT-R 


Multiple  Choice  Items 


As  used  here,  multiple  choice  items  include  true-false  and  yes-no, 
and  similar  dichotomous  items  as  special  cases.  Generally,  studies  having 
to  do  with  right/wrong  responses  were  excluded  from  the  literature  review, 
unless  they  appeared  to  have  direct  relevance  to  the  use  of  multiple  choice 
items  in  questionnaires.  Comparisons  of  rating  scale  and  multiple  choice 
items  were  reported  above. 

Some  issues  related  to  the  use  of  multiple  choice  items.  A number 
of  issues  appeared  in  the  literature  related  to  the  use  of  multiple  choice 
items.  Those  not  more  appropriately  discussed  in  other  chapters  are 
reviewed  here. 

Swordes  (1952)  discovered  that  in  a test  using  items  with  both  four 
and  five  choices,  a number  of  those  taking  the  test  marked  the  fifth  space 
on  the  answer  sheet  when  the  question  had  only  four  possible  responses. 

It  was  concluded  that  certain  preventions  should  be  taken  to  reduce  the 
undesirable  results  of  using  a different  number  of  distractors  in  the  same 
examination.  These  include  special  instructions,  reduction  of  the  number 
of  alternate  groups,  and  restriction  of  a varying  number  of  choices  to  the 
more  capable  test  takers,  when  practicable. 

Hosier  and  Price  (1945)  presented  a means  to  overcome  the  usual  problem 
in  multiple  choice  items  construction:  the  arrangement  of  the  response 

alternatives.  They  used  a table  in  which  the  120  permutations  of  the 
numbers  one  through  five  had  been  randomized.  They  pointed  out,  however, 
that  such  a table  should  not  be  used  when  the  response  alternatives  form 
a logical  pattern. 

Cronbach  ( 194 la)  compared  multiple  choice  and  multiple  true- 
false  tests,  with  instructions  to  guess.  He  found  little  significant 
differences  between  them.  However,  the  multiple  choice  type  of  test  had 
slightly  higher  reliability  and  seemed  slightly  easier  to  score.  Hence, 
evidence  from  the  study  supported  the  use  of  the  multiple  choice  rather 
than  multiple  true-false  form,  if  omissions  are  not  expected. 

Data  presented  by  Knowles  (1963)  demonstrated  that  questionnaires  of 
the  true-false  type  can  be  differentially  prone  to  acquiescence  response 
set.  This  topic  is  discussed  in  detail  in  Chapter  X. 

It  was  noted  by  Tuckman  and  Lorge  (1953)  that  graduate  students 
experienced  frustration  because  of  the  either/or  choice  when  circling  a 
yes  or  no  response  when  asked  whether  they  generally  agreed  or  disagreed 
with  statements  about  older  people.  Hence,  the  authors  conducted  a study 
where  the  same  questions  were  used  but  where  the  response  was  the  per- 
centage of  older  people  for  whom  the  question  would  apply.  No  significant 
differences  were  found  between  the  two  methods,  causing  the  authors  to 
conclude  that  the  yes-no  method  was  preferred  due  to  its  scoring  ease. 


Comparison  of  multiple  choice  and  forced  choice  or  paired  comparison 
items . Appel  (1959)  administered  a 72  item  true-false  questionnaire  and 
a parallel  forced  choice  questionnaire  consisting  of  24  triads.  The 
content  of  the  items  on  each  form  was  identical.  Based  upon  the  forecasted 
validities  for  the  best  keys  of  the  two  forms  for  an  Infinite  number  of 
items,  Appel  concluded  that, for  longer  forms, the  forced  choice  method  is 
likely  to  result  in  greater  validity,  while  for  shorter  forms  the  true- 
false  method  is  likely  to  prove  superior. 

Osburn,  Lubin,  Loeffler,  and  Tye  (1954)  compared  the  relative  validity 
of  forced  choice  and  single  stimulus  yes-no  self-description  items.  The 
contents  of  the  items  that  were  compared  were  identical.  No  significant 
differences  were  found,  but  the  results  seemed  to  suggest  that  the  choice 
of  format  would  depend  upon  the  number  of  items  available  and  their 
statistical  characteristics. 

A forced  choice  form  of  an  Interest  inventory  was  compared  with  a 
like-indifferent-disLike  form  by  Perry  (1955),  using  the  same  items  for 
groups  of  Navy  yeomen  and  college  students.  Unit  weight  and  multiple 
weight  keys  were  developed  for  each  inventory  to  differentiate  yeoman  from 
students.  The  forced  choice  keys  were  superior  in  separating  groups  in 
seven  of  ten  comparisons.  However,  there  was  tittle  difference  in  validity 
shrinkage  for  the  two  kinds  of  items. 

Comparison  of  multiple  choice  items  and  card  sorts.  Van  Der  Veen, 
Howard,  and  Austria  (1970)  compared  response  formats  of  Q-sort,  multiple 
choice,  and  true-false  methods  according  to  test-retest  reliability  and 
scoring  characteristics.  The  analyses  suggested  that  all  three  forms  were 
reliable  in  test-retest  situations.  Both  the  multiple  choice  and  Q-sort 
methods  showed  high  stability.  However,  the  former  showed  some  variance 
for  social  desirability.  The  true-false  method  was  found  psychometrically 
inferior,  showing  lower  stability  and  some  social  desirability  variance. 

The  authors  concluded  that  the  Q-sort  is  the  format  of  choice  if  testing 
time  is  available,  otherwise  the  multiple  choice  format  should  be  used. 

Comparison  of  multiple  choice  and  open-ended  items.  Two  articles 
compared  multiple  choice  and  open-ended  items.  In  the  first,  Rugg  and 
Cantril  (1942)  examined  the  form  of  the  question  in  public  opinion  polls 
by  using  multiple  choice,  dichotomous  choice,  and  free  response  formats. 
Through  five  different  polls  they  reached  the  conclusion  that  in  all  cases 
no  one  method  was  best.  Multiple  choice  gave  accurate  placement,  while 
dichotomous  was  simply  scored.  In  addition,  free  response  gives  respondents 
the  most  freedom  of  expression. 

In  the  second  study,  Gustav  (1964)  compared  responses  to  a questionnaire 
concerning  methods  of  study  and  preferences  for  true-false,  multiple  choice, 
and  essay  questions,  with  actual  test  scores  for  102  undergraduates.  True- 
false  items  were  liked  least.  A large  proportion  of  the  group  reported 
they  studied  differently  for  particular  types  of  examinations,  and  slightly 
more  than  half  believed  they  do  equally  well  on  all  types  of  tests  despite 
any  preferences. 


TT  T _ 1 n 


Forced  Choice  and  Paired  Comparison  Items 


This  section  begins  with  a review  of  some  Issues  related  to  forced 
choice  and  paired  comparison  questionnaire  items.  Forced  choice  and 
paired  comparison  items  are  next  compared  with  card  sorts  and  then  with 
check  lists.  They  were  compared  with  ranking  items,  rating  scales,  and 
multiple  choice  items  in  earlier  sections  of  this  chapter. 

Some  issues  related  to  forced  choice  and  paired  comparison  items.  In 
a paper  presented  by  the  Personnel  Research  Section,  PRPB , Adjutant-General's 
Office  (1946),  it  was  noted  that  the  utility  of  rating  scales  for  predictive 
purposes  or  administrative  action  had  been  limited  by  the  ease  with  which 
a rater  could  determine  accurately  where  he  was  placing  a person  on  a scale. 

A technique,  the  forced  choice  method,  which  reduces  the  rater’s  ability 
to  control  the  final  result  of  his  ratings,  was  described.  It  was  noted 
that  the  essence  of  the  forced  choice  technique  is  to  force  the  rater  to 
choose  between  descriptive  phrases  which  appear  of  equal  value  (have  the 
same  preference  index)  but  are  different  in  '’alidity  (discrimination  index). 

The  major  problem  is  the  grouping  of  alternatives  to  achieve  these  ends. 

The  preference  index  is  the  mean  of  the  scale  indicating  the  degree  to 
which  the  phrase  applies  to  the  group  concerned,  while  the  discrimination 
index  represents  the  correlation  of  the  descriptive  phrase  and  an  overall 
rating. 

As  noted  by  Buel  (1963),  in  the  construction  of  a forced  choice  scale 
the  preference  and  discrimination  indices  are  usually  derived  from  responses 
to  items  in  check  list  form.  The  items  are  then  grouped  on  the  basis  of 
similar  preference  index  but  dissimilar  discrimination  index.  It  is  assumed 
that  the  preference  value  of  an  item  does  not  change  when  it  is  transferred 
from  its  position  in  the  check  list  form  to  a position  in  the  forced  choice 
form.  In  his  study,  Buel  (1963)  found  that, while  the  preference  index 
values  of  only  a few  items  changed,  such  shifts  generally  hau  the  effect 
of  reducing  the  discrimination  index  values.  Waters  and  Wherry  (1961b) 
also  investigated  the  stability  of  preference  index  values  from  check 
list  to  forced  choice  administration,  and  found  a high  degree  of  stability. 
Berkshire  and  Highland  (1953)  reported  that  a favorableness  index  fits  into 
the  forced  choice  rationale  better  than  does  the  preference  index.  Bartlett 
(1960)  made  comparisons  between  the  two,  and  concluded  that, if  for  practical 
reasons  only  one  index  is  used  for  matching,  the  preference  index  appears 
to  be  the  better. 

Two  studies  that  were  reviewed  considered  the  failure  to  adequately 
match  the  forced  choice  items.  Bartlett  (1960)  used  a scale  where  the 
items  within  each  set  were  not  perfectly  matched  on  preference,  discrimination, 
favorableness,  general  factor  loading,  and  magnitude  of  group  factor  loading. 
Multiple  correlations  indicated  that  about  half  of  the  variance  of  rating 
response  for  both  peer  and  self-ratings  could  be  explained  by  failure  to 
match  on  these  five  indices.  Eisenberg  (1965)  found  significantly  lower 
scores  when  a form  was  used  with  items  not  matched  on  preference  index 
and  different  on  discrimination  index,  compared  to  an  identical  form 
developed  along  classical  lines. 


Zavala  (1965),  in  his  review  of  the  development  of  the  forced 
choice  technique,  pointed  out  that  the  reliabilities  and  validities  of  the 
technique  compare  favorably  with  other  methods,  and  that  studies  have 
shown  that  the  method  is  more  resistant  than  other  scales  to  effects  of 
bias.  Earlier,  however,  Travers  (1951)  conducted  a critical  review  of  the 
validity  and  rationale  of  the  forced  choice  technique  and  noted  that,  as 
used  in  officer  efficiency  reports,  the  evidence  did  not  support  claims 
made  for  the  validity  of  the  procedure.  He  also  concluded  that  the  high 
validity  coefficients  secured  must  be  considered  to  be  largely  spurious 
until  they  are  demonstrated  to  be  otherwise.  As  noted  in  an  earlier  section, 
Scott  (1968)  similarly  concluded  that  the  generalization  that  self-report 
forced  choice  personality  inventories  are  more  valid  than  single  stimulus 
forms  of  the  same  tests  was  not  supported  by  critical  consideration  of 
the  relevant  evidence. 

In  the  study  by  Berkshire  and  Highland  (1953)  , six  kinds  of  forced 
choice  formats  were  compared  for  rating  Air  Force  technical  instructors 
under  experimental  conditions  and  under  instructions  to  give  as  high  a 
score  as  possible.  The  results  for  the  six  forms  were: 

1.  Form  A:  Two  statements  per  block,  both  favorable  or  both 

unfavorable,  choose  the  more  descriptive  or  the  least  descriptive.  Had 
relatively  hign  reliabilities  and  validities,  was  one  of  the  two  best  liked, 
but  was  markedly  unsatisfactory  in  its  failure  to  resist  leniency  effects. 

Was  also  uneconomic  in  that  over  half  of  the  blocks  failed  to  discriminate 
when  subjected  to  item  analysis. 

2.  Form  B:  Three  statements  per  block,  all  favorable  or  unfavorable, 

choose  the  most  and  least  descriptive  statements  in  each  block.  Low  in 
validity,  lowest  in  reliability,  least  liked  by  the  raters,  and  uneconomic. 

Was,  however,  resistant  to  skewing  under  instructions  to  bias. 

3.  Form  C:  Four  statements  per  block,  all  favorable,  choose  the 

two  most  descriptive  statements.  Most  bias  resistant,  yielded  consistently 
high  validities  under  various  conditions,  was  one  of  the  two  best  liked, 
and  had  adequate  reliability.  This  method  was  superior  to  the  other  methods 
tested. 

4.  Form  D;  Four  statements  per  block,  all  favorable,  choose  the  most 
and  least  descriptive  statements.  Comparable  to  Form  C in  reliability  and 
validity,  but  was  more  susceptible  to  leniency  effects  and  less  well  liked. 

5.  Form  E:  Four  statements  per  block,  two  favorable  and  two  unfavorable 
in  appearance,  choose  the  most  and  least  descriptive  statements.  An  inadequate 
method,  easily  biased,  low  validity,  and  not  as  well  liked  as  Forms  A,  C,  and  F. 

6.  Form  F:  Five  statements  per  block,  two  of  which  were  favorable, 

one  neutral,  and  two  unfavorable  in  appearance,  choose  the  most  and  least 
descriptive.  Too  easily  biased  for  use.  Was  moderately  well  liked,  but 
was  exceeded  in  validity  by  Forms  A,  C,  and  D. 


Agreeing  with  Berkshire  and  Highland,  Zavala  (1965)  also  noted  that 
formats  using  four  favorable  Items, from  which  the  rater  chooses  the  Items 
most  characteristic  of  the  person  rated,  proved  superior  to  other  formats. 

He  said  that  this  superiority  appeared  In  validities,  reliabilities,  and 
preferences  of  raters  using  the  form. 

In  other  studies  related  to  forced  choice  format,  Zuckerman  (1952) 
found  that  the  llke-lndlfferent-dlsllke  arrangement  of  self-report  Interest 
Inventories  was  clearly  superior  to  the  two  choice  form.  Waters  and  Wherry 
(1961a) reported  on  the  effect  of  response  format  on  subject  resistance  to 
a forced  choice  self-rating  scale.  The  subjects  .were  found  to  be  more 
favorable  toward  a response  format  allowing  them  to  Indicate  the  degree 
of  applicability  of  each  statement  In  the  forced  choice  pair,  even  though 
they  were  still  forced  to  choose  one  statement  as  relatively  more  applicable. 
Waters  (1966)  also  found  that  reaction  to  a forced  choice  scale  was  more 
favorable  when  some  method  was  Incorporated  whereby  the  subject  was  given  an 
opportunity  to  Indicate  the  degree  of  applicability  of  each  Item  to  himself. 

The  effects  of  partial  pairings  was  Investigated  In  two  studies. 

McCormick  and  Bachus  (1952)  conducted  a study  to  determine  the  extent  to 
which  It  would  be  possible.  In  paired  comparison  ratings  of  employees,  to 
use  reduced  numbers  of  pairings  and  still  achieve  essentially  the  same 
rating  results  as  would  be  obtained  from  a complete  pairing  of  all  Individuals 
within  a group.  The  results  showed  that  ratings  obtained  from  partial 
pairings  resulted  In  fairly  high  correlations  with  ratings  based  on  complete 
pairings.  The  correlations  were  reduced  rather  systematically  with  reductions 
in  the  number  of  pairs  per  individual  on  which  the  ratings  were  based.  In 
a follow-up  article,  McCormick  and  Roberts  (1952)  reported  that  the  relia- 
bility of  ratings  obtained  with  partial  pairings  also  tended  to  decrease  rather 
systematically  with  reductions  In  the  number  of  pairs  per  Individual  on 
which  the  ratings  were  based.  However,  for  groups  of  50  Individuals,  ratings 
based  on  as  few  as  16  pairs  per  Individual  appeared  to  be  relatively  reliable. 

As  noted  above,  Zavala  (1965)  reported  that  studies  on  the  forced 
choice  method  showed  It  to  be  more  resistant  than  other  scales  to  effects 
of  bias.  As  will  be  discussed  In  Chapter  K,  the  forced  choice  method 
has  been  used  by  a number  of  Investigators  In  an  attempt  to  control 
the  tendency  of  Individuals  to  answer  self-report  Inventories  In  terms  of 
response  sets  rather  than  giving  "true"  responses.  For  example,  Jackson 
and  Minton  (1963)  concluded  that  combining  Items  Into  scales  and  casting 
them  Into  a paired  comparison  context  Is  the  method  of  choice  In  constructing 
adjective  check  lists  for  personality  assessment.  This  conclusion  was 
based  upon  the  effects  of  the  forced  choice  format  In  enhancing  content 
reliability  and  eliminating  the  massive  response  set  to  check  many  or 
few  Items  on  the  check  list.  Howe  (1960),  however,  working  with  anxiety 
scales,  reported  that  data  concerning  rellabllltj-  and  skewness  did  not 
give  an  unequivocal  Impression  that  the  forced  choice  format  reduces  the 
tendency  to  give  socially  desirable  responses.  Feldman  and  Corah  (1960) 
also  reported  that  social  desirability  Is  not  minimized  by  the  forced 
choice  format.  Braun  (1969)  pointed  out  that  there  Is  no  effective  control 
for  social  desirability  of  axternatlves  presented,  nor  for  fake-proof 


III-13 


r 


devices.  Lederman  (1971)  interpreted  data  as  showing  that  the  forced 
choice  format  cannot  prevent  subjects  from  presenting  a more  favorable 
image  of  themselves  if  they  choose  to  do  so,  but  that  the  problem  is  usually 
less  in  the  forced  choice  than  in  the  questionnaire  format. 

Studies  of  the  results  of  asking  subjects  to  fake  their  responses 
have  been  conducted  by  a number  of  authors,  Including  Izard  and  Rosenberg 
(1958)  and  Eisenberg  (1965).  Izard  and  Rosenberg  used  a forced  choice 
personality  test  with  naval  aviation  cadets.  They  found  that  forced 
choice  scores  under  instructions  for  a "set  to  fake"  did  not  significantly 
differ  from  regular  scores,  suggesting  that  the  t^jst  is  not  easily  suscept- 
ible to  faking.  Eisenberg  (1965),  however,  found  that  instructions  to  fake 
did  affect  results  when  a forced  choice  format  developed  in  the  classical 
manner  was  used. 

Comparison  of  forced  choice  items  and  card  sorts.  In  a study  by  Turgut 
(1963),  a paired  comparison  format  and  a modified  Q-sort  were  compared  for 
efficiency  in  personality  measurement.  The  experiment  tested  reliabilities 
per  unit  of  testing  time  and  acceptability  to  the  examinees.  The  internal 
consistency  coefficients  of  the  paired  comparison  format  averaged  .77.  The 
average  for  the  Q-sort  was  .73  when  corrected  for  the  average  time  spent. 
Subjects'  reaction  to  the  formats  were  measured  by  a rating  scale,  and 
showed  that  57^»  liked  the  paired  comparison  form  and  32°4  liked  the  Q-sort. 

Comparison  of  forced  choice  items  and  check  lists.  Forced  choice  or 
paired  comparison  items  were  compared  with  check  lists  in  two  studies. 

In  the  U.S.  Department  of  Army  study  (1952)  previously  referred  to,  a 
controlled  check  list  with  24  items  where  the  12  most  descriptive  were  to 
be  selected  was  used  in  addition  to  forced  choice  pairs.  The  validities 
of  the  forced  choice  pairs,  based  on  rankings  by  approximately  20  class- 
mates, was  .41;  of  the  controlled  check  list,  .44. 

Merenda  and  Clarke  (1963)  compared  two  self-rating  adjective  check 
lists.  The  first  was  the  regular  free  response  list,  the  second  a forced 
choice  version  where  the  adjectives  were  arranged  in  tetrad  sets.  Ipsative 
scoring  (discussed  in  Chapter  XI)  was  used.  The  results  suggested  that 
the  forced  choice  method  is  likely  to  be  inappropriate  for  use  with  adjective 
check  lists  in  self-concept  assessment. 


Card  Sorts 


The  advantages  of  using  card  sorts  for  acquiring  racing  information 
on  any  issue  has  been  discussed  by  a number  of  authors,  including  Dubois 
(1949-50).  The  most  extensive  discussion  of  the  use  of  card  sorts  (or,  more 
generally,  Q-technique  and  its  methodology)  probably  appears  in  The  Study 
of  Behavior  by  William  Stephenson  (1953).  Card  sorting  as  a technique  for 
survey  interviewing  was  discussed  by  Cataldo,  Johnson,  and  Kellstedt  (1970), 


111-14 


J 


who  assessed  its  reliability,  validity,  and  response  bias  and  the  reactions 
of  respondents  and  interviewers.  They  concluded  that  card  sorting  is  a 
fast  and  interesting  method  of  obtaining  valid  and  reliable  interview  data, 
and  one  which  appears  to  be  capable  as  well  of  counteracting  at  least  some 
of  the  biasing  effects  of  response  set. 

Four  articles  were  abstracted  that  addressed  the  issue  of  whether  Q- 
sorting  procedures  should  allow  a free  or  unforced  sort  where  the  subject 
is  allowed  to  place  as  many  cards  as  desired  within  the  sorting  intervals, 
or  require  a forced  sort  where  a predetermined  number  of  items  have  to 
be  placed  in  each  interval  cell.  Block  (1956)  compared  forced  and  unforced 
Q-sorting  procedures  using  76  items  with  11  sorters.  The  forced  sort  seemed 
to  offer  more  stability  and  slightly  more  discrimination  than  unforced 
sorting.  Gaito  (1962),  considering  statistical  and  non-statistical  aspects 
of  Q-sorting,  concluded  that  severe  defects  appeared  present  for  various 
analysis  tests  of  significance  when  forced  sorting  was  involved;  moderate 
distortion  when  the  free  sort  was  used.  Hess  and  Hink  (1959)  also  compared 
the  forced  and  free  Q-sort  procedures,  and  found  that  the  two  types  of 
administration  gave  similar  results  when  the  identical  Q-sort  was  used  with 
adolescents.  A similar  conclusion  was  reached  by  Brown  (1971).  He  noted 
that  arguments  favoring  free  over  forced  Q-sorts  had  assumed  that  forcing 
leads  to  the  loss  of  important  statistical  Information  and  interferes  with 
interval  properties,  rendering  Pearson's  _r  inappropriate  for  analysis.  He 
found  that  Q-sorts  with  identical  item  orderings  but  with  varied  distri- 
butions provided  essentially  the  same  correlations  and  factor  structures 
when  coefficients  were  computed  using  Spearman's  rg,  Kendall's  r,  and 
Pearson's  Hence,  he  concluded  that  the  same  results  are  obtained  despite 

distribution  and  whether  interval  or  ordinal  statistics  are  used. 

In  previous  sections  of  this  chapter, card  sorts  were  compared  with 
rating  scales,  multiple  choice  items,  and  forced  choice  items. 


Semantic  Differential  Items 


This  section  reviews  some  of  the  pros  and  cons  about’  the  use  of  the 
semantic  differential,  and  presents  only  a few  of  the  many  articles  on 
the  technique.  The  first  major  paper  on  the  semantic  differential  was 
by  Osgood  (1952),  in  which  the  development  of  the  technique  as  a general 
method  of  measuring  meaning  was  described.  Ic  involved:  the  use  of  factor 

analysis  to  determine  the  number  and  nature  of  factors  entering  into 
semantic  description  and  judgment;  and  the  selection  of  a set  of  specific 
scales  corresponding  to  these  factors  which  can  be  standardized  as  a measure 
of  meaning.  Using  this  differential,  the  meaning  of  a particular  concept 
to  a particular  individual  can  be  specified  quantitatively.  The  classical 
book  on  the  semantic  differential  was  written  by  Osgood,  Suci,  and  Tannebaum 
(1957). 

Two  studies  that  were  reviewed  investigated  the  reliability  of  the 
semantic  differential,  Jenkins,  Russell,  and  Suci  (1958)  had  360  words 
rated  on  20  scales  by  18  groups  of  30  subjects.  Profiles  of  mean  scale 
values  for  each  concept  were  prepared.  The  reliability  of  these  scale 


111-15 


values  was  found  to  be  .97,  and  mean  scale  values  correlated  .97  with 
median  scale  values.  Miron  (1961)  investigated  the  influence  of  instruc- 
tions upon  the  test-retest  reliabilities  of  the  semantic  differential, 
and  found  the  correlations  ranged  from  .996  to  .857.  The  basic  measure 
used  in  the  experiment  was  the  absolute  deviation  between  mean  concept 
scores  on  each  of  five  factor  scores  summarizing  a given  set  of  scales. 

Two  reports  of  the  validity  of  the  semantic  differential  were  reviewed, 
both  in  the  marketing  area.  Agostini  (1962)  reported  evidence  on  the 
validity  of  the  technique  as  an  indicator  of  brand  attitude  as  measured 
by  purchase  behavior.  Significantly  higher  brand  average  attitude  scores 
were  found  among  users  of  two  brands  of  a food  product  than  among  nonusers, 
thus  illustrating  the  validity  of  the  semantic  differential  for  this  use. 
Barclay  (1964)  also  found  that  the  semantic  differential,  in  the  form  used, 
was  a valid  indicator  of  brand  attitudes  as  inferred  from  purchasing 
behavior.  However,  the  differential  as  used  was  found  not  to  be  a very 
sensitive  measure. 

Proximity  error  in  administering  the  semantic  differential  was 
studied  by  Kane  (1968).  Proximity  error  occurs  when,  due  to  the  ordering 
or  polarity  of  the  semantic  differential  scales,  one  answer  results  in 
another  answer  to  a subsequent  question  being  substantially  changed  from 
what  it  would  otherwise  be.  He  investigated  effects  due  to  the  order  of 
concept  presentation,  of  adjective  presentation,  and  of  order  of  adjectives 
within  a particular  scale.  He  found  no  significant  differences  in  response 
traceable  to  questionnaire  format  manipulations,  showing  that  proximity 
error  was  not  a problem  with  semantic  differential  questionnaires. 

Worthy  (1969)  noted  that  semantic  differential  rating  scores  are  often 
reported  as  an  extreme  response  measure  which  ignores  the  middle  or  neutral 
categories  as  a response.  He  reanalyzed  data  and  concluded  that  those  who 
tended  to  make  extreme  responses  also  tended  to  make  midpoint  responses. 

The  implication  for  scoring  was  not  to  make  the  assumption  thit  a midpoint 
response  is  totally  lacking  in  extremeness  since  it  is  a demonstrative 
response.  A related  concern  has  been  whether  or  not  the  semantic  differ- 
ential measures  both  the  intensity  and  direction  of  attitude.  Mehling  (195-9) 
plotted  subjects' ratings  on  an  intensity  scale  against  responses  to  related 
semantic  differential  scales, and  concluded  that  as  used  in  the  study  the 
semantic  differential  did  measure  both  the  direction  and  intensity  of 
attitude.  Rentier  (1969)  also  found  that  semantic  space  is  approximately 
bipolar,  while  Carter,  Ruggels,  and  Chaffee  (1968)  found  that  subjects  can 
more  accurately  denote  their  descriptions  to  objects  when  one  end  of  the 
scale  is  left  for  them  to  describe. 

Semantic  differential  scales  were  compared  with  check  lists  by  Block 
(1958)  and  Hughes  (1967).  Block  found  a correlation  of  .94,  after  correction 
for  attenuation,  between  semantic  differential  descriptions  and  adjective 
check  list  descriptions  of  the  ideal  self  and  the  liked-sex  parent.  He 
concluded  that  the  semantic  differential  may  be  a rather  complicated  way 
of  developing  a measure  that  is  more  readily  and  reliably  secured  by  other 


111-16 


n»eans.  Hughes  (1967)  reported  that  287o  of  the  semantic  differential 
scales  he  used  detected  attitude  change,  while  only  117o  of  the  check  list 
scales  did.  Both  showed  the  same  test-retest  reliability,  however  (.58). 
Preference  for  the  semantic  differential  increased  from  117<.  to  347.  as  the 
respondents  became  familiar  with  it,  while  preference  for  the  check  list 
declined  from  577.  to  407„. 

Comparisons  of  the  semantic  differential  with  other  types  of  rating 
scales  appear  in  an  earlier  section  of  this  chapter. 


Other  Types  of  Items 

The  types  of  items  to  be  considered  in  this  section  are  projective 
items,  open-ended  items,  check  lists,  rearrangement  items,  and  matching 
items.  Comparisons  of  check  list  items  and  rating  scale,  forced  choice, 
and  semantic  differential  items  were  discussed  in  previous  sections  of 
this  chapter. 

Projective  items.  The  use  of  projective  items  was  not  a high  priority 
topic  for  this  report,  but  three  reviewed  documents  discussed  them.  In  the 
study  of  attitudes  of  troops  in  the  tropics  authored  by  Hart,  Faust,  Rowland, 
and  Lucier  (1964),  complementary  objective  and  projective  techniques  were 
compared  and  contrasted  for  their  efficacy  in  assessing  attitudes  towards 
items  of  QM  issue  and  situations  relating  to  tropical  military  service. 

They  found  that  projective  and  unstructured  data  collection  techniques 
provided  attitudinal  data  not  captured  by  the  more  structured  techniques. 

They  also  found  that  responses  to  objective  items  correlated  significantly 
with  sentence  completion  items  on  the  same  topic.  Thematic  stimuli  provided 
in  a projective  pictures-wr itten  response  technique  were,  however,  inade- 
quate for  eliciting  the  appropriate  topic  related  attitudinal  responses, 

A color  response  technique  did  not  indicate  any  relationships  with  other 
techniques.  Nevertheless,  the  authors  recommended  that  a combination  of 
highly  structured,  semistructured , and  unstructured  techniques  be  employed 
in  a complex  measurement  setting,  as  is  typical  in  the  case  of  attitude 
measurement . 

In  the  marketing  area,  Halre  (1950)  found  that  projective  methods 
may  aid  in  determining  respondent's  motivations  toward  a stimulus  in  linking 
attitudes  and  behavior.  Steele  (1964)  investigated  the  validity  of  pro- 
jective questions,  and  concluded  that  the  projective  technique  is  a useful 
device  where  inhibitions  may  be  raised  in  an  interview. 

Open-ended  items.  A comparison  of  open-  and  close-ended  questionnaire 
items  was  presented  in  Chapter  II,  while  the  use  of  open-ended  items  to 
determine  questionnaire  content  was  discussed  in  the  first  section  of  this 
chapter , 

On  other  relevant  topics,  Roslow,  Wulfeck,  and  Corby  (1940)  noted 
that  results  from  free  response  questions  may  be  misleading  when  the 
memory  of  the  respondent  and/or  familiarity  with  possible  responses 
operates  to  any  appreciable  extent.  Payne  (1965)  cites  a meaningful  role 


111-17 


for  open-ended  questions  in  preliminary  phases  of  research  in  areas  such 
as  the  development  of  categorical,  checkbox  questions,  to  eliminate  the 
need  for  asking  reason-why  questions  of  every  respondent,  or  to  provide 
quotes  which  may  add  interest  to  a report.  Frisbie  and  Sudman  (1968) 
found  a direct  relation  between  the  amount  of  speech  in  open-ended  questions 
and  positive  and  negative  feelings.  People  with  high  positive  or  negative 
feelings  talked  more  than  those  with  low  positive  or  negative  feelings. 

In  both  cases,  those  classified  as  having  high  feelings  had  one  more  sentence 
than  those  classified  as  low. 

Comparison  of  open-ended  items  and  check  lists.  Two  studies  compared 
the  use  of  open-ended  questions  with  check  lists.  Scates  and  Yoemans  (1950b) 
studied  the  effect  of  question  form  on  the  course  requests  that  were  received 
from  adults  employed  in  scientific  and  engineering  fields.  They  found  that 
questionnaires  involving  depth  essay  questions  were  returned  by  a smaller 
proportion  of  persons,  but  the  requests  which  they  contained  were  believed 
to  be  more  firmly  based.  A course  check  list  elicited  a larger  number  of 
course  requests  per  employee  who  returned  it  than  did  questions  which  asked 
the  employee  to  think  of  the  courses  he  may  desire. 

The  check  list  and  open  response  methods  of  survey  research  were 
compared  by  Belson  and  Duncan  ( 1962)  with  respect  to  yesterday's  reading 
of  newspapers  and  magazine^  and  with  respect  to  yesterday's  TV  viewing. 

Results  indicated  that  offering  items  in  the  form  of  a check  list  produced 
an  appreciably  higher  rate  of  claim  that  publications  were  looked  at.  How- 
ever, the  check  list  was  found  to  depress  the  enumeration  of  items  placed 
under  its  "other"  category.  The  open  response  system  produced  only  73%  of 
the  volume  of  endorsements  produced  by  the  check  list,  but  it  gave  1.72 
times  as  many  compared  to  the  "other"  category. 

Other  topics  regarding  the  use  of  check  lists.  Roslow,  Wulfeck,  and 
Corby  (1940)  found  that  the  proportions  obtained  by  alternatives  in  check 
list  questions  tended  to  be  influenced  by  the  number  and  completeness  of 
the  alternatives  presented.  And  Lindzey  and  Guest  (1951)  found  that  omissions 
of  popular  items  from  check  lists  produced  substantial  changes  in  response 
distribution.  They  also  found  that  few  respondents  used  the  "other-write  in" 
category. 

McCormick  (1960)  studied  the  effect  that  the  number  of  questions  asked 
about  each  task  had  on  the  consistency  and  amount  of  information  provided 
by  Air  Force  personnel  when  completing  task  inventories.  No  systematic 
differences  were  found  in  the  number  of  tasks  reported  by  incumbents  who 
were  asked  to  report  one,  two,  three,  or  four  types  of  information  about 
such  tasks.  The  requirement  to  report  more  types  of  information  generally 
provided  more  reliable  information. 

Rearrangement  items.  Sims  (1934)  examined  the  use  of  rearrangement 
tests  of  ability  as  an  alternative  to  objective  tests.  He  found  that  as 
the  length  or  the  rearrangement  set  increased  from  five  to  15  items,  the 
reliability  also  increased.  At  some  point  before  30  items,  however,  the 
length  of  the  test  seemed  merely  to  reflect  the  student's  intelligence. 

He  concluded  that  this  type  of  test  compares  favorably  with  other  types  of 


TTT-IS 


objective  tests  In  reliability,  time  for  taking,  and  scoring  time,  when 
the  desire  is  to  measure  the  ability  to  relate  items  to  some  designated 
basis. 

Articles  more  pertinent  to  the  use  of  rearrangement  items  in  question- 
naires were  not  located  during  the  literature  search  which  preceded  this 
review. 

Matching  items.  The  literature  review  uncovered  only  one  article  on 
the  topic  of  item  matching.  Follman,  Urbanke , and  Burley  (1971)  compared 
three  item  matching  objective  test  formats  using  60  undergraduate  students 
who  were  asked  to  match  definitions  to  20  appropriate  verbs  typically  used 
in  essay  type  questions.  The  results  indicated  that  better  scores  were 
obtained  by  ordering  the  items  randomly  but  dividing  them  into  small  groups 
of  three  to  six  items. 

Conclusions  Regarding  the  Pros  and  Cons  of  Various  Types  of  Questionnaire 
Items 

Ranking  items.  Based  upon  five  studies,  it  would  appear  that  ranking 
and  rating  techniques  are  generally  comparable.  There  is  some  evidence, 
however,  that  conclusions  based  upon  a single  judge  differ  from  those  based 
upon  multiple  judges.  More  research  appears  to  be  needed  in  this  area, 
especially  studies  designed  so  that  the  items  to  be  ranked  or  rated  are 
as  comparable  as  possible. 

Contradictory  evidence  was  obtained  regarding  the  comparison  of  ranking 
and  paired  comparisons.  The  bulk  of  the  evidence,  however,  seems  to  support 
the  notion  that  the  two  techniques  produce  comparable  results.  In  two  studies 
a linear  relationship  was  found  between  the  results  obtained  from  the  two. 
Several  investigators  noted  that,  if  the  results  are  comparable,  ranking  is 
to  be  preferred  to  paired  comparisons  since  it  takes  less  time. 

Rating  scale  items.  A majority  of  the  studies  reviewed  found  that 
results  obtained  from  the  use  of  rating  scales  were  comparable  to  those  when 
forced  choice  or  paired  comparison  items  were  employed.  This  is  surprising 
in  terms  of  the  known  properties  and  limitations  of  the  ipsative  scores 
produced  by  forced  choice  devices,  as  discussed  in  Chapter  XI.  However, 

Scott  (1968)  noted  that  generalizations  about  forced  choice  and  single 
stimulus  tests  are  open  to  question  since  mo-st  studies  did  not  use  identical 
items  in  the  two  forms  compared.  If  results  from  the  two  are  comparable, 
then  rating  scales  might  be  preferred  since  they  are  more  efficient  and  take 
less  testing  time. 

Results  comparing  rating  scales  and  card  sorts  are  inconclusive.  Only 
two  studies  were  found  in  this  area.  Results  are  similarly  inconclusive 
in  the  comparison  of  rating  scales  and  semantic  diflerential  items  and 
rating  scales  and  check  lists,  as  few  studies  were  located. 


111-19 


Multiple  choice  items.  Comparisons  between  multiple  choice  items  and 
true-false  items  (which  are  a special  type  of  multiple  choice  item)  are 
discussed  in  Chapter  VI.  The  three  studies  that  reported  comparisons  of 
multiple  choice  and  forced  choice  items  did  not  come  to  the  same  conclusion, 
although  there  was  some  tendency  to  feel  that  a choice  of  format  would  depend 
upon  the  number  of  items  available  and  their  statistical  characteristics. 

In  the  one  study  located  that  compared  multiple  choice  items  and  card  sorts 
(Van  Der  Veen,  Howard,  & Austria,  1970),  it  was  concluded  that  the  true-false 
method  was  psychometrically  inferior,  and  the  Q-sort  should  be  used  in  pre- 
ference to  the  multiple  choice  format  if  adequate  testing  time  is  available. 
There  is  also  little  on  which  to  base  a conclusion  regarding  multiple  choice 
and  open-ended  items.  Each  might  have  its  p lace , 'depending  upon  the  purposes 
and  objectives  of  any  given  study. 

Forced  choice  and  paired  comparison  items.  Some  issues  related  to  forced 
choice  and  paired  comparison  items  were  reviewed.  Based  upon  two  studies, 
problems  seem  to  arise  when  the  alternatives  within  a forced  choice  item  are 
not  adequately  matched.  Although  one  investigator  (Zavala,  1965)  pointed 
out  that  the  forced  choice  technique  compared  favorably  with  other  methods 
in  terms  of  reliability  and  validity,  two  others  (Travers,  1951;  Scott,  1968) 
felt  that  some  of  the  claims  made  were  not  supported  by  the  evidence.  A 
more  critical  and  detailed  review  of  the  studies  conducted  in  this  area  is 
probably  in  order. 

Two  investigations  (Berkshire  & Highland,  1953;  Zavala,  1965)  lead 
to  the  conclusion  that  the  best  format  for  forced  choice  items  (at  least 
when  used  for  personnel  ratings)  is  four  statements  per  block,  all  favorable, 
where  the  two  most  descriptive  statements  are  to  be  chosen.  Zuckerman 
( 1952),  however , found  three  statements  preferable  to  two.  In  terms  of 
what  was  noted  regarding  the  need  to  adequately  match  items,  more  research 
on  this  issue  would  probably  be  worthwhile. 

There  still  seems  to  be  some  question  as  to  the  extent  that  forced 
choice  items  can  be  used  to  reduce  undesirable  response  sets,  at  least 
in  terms  of  the  articles  included  in  this  review.  Since  this  was  not  a 
topic  of  great  stress  during  the  literature  review  because  many  of  the 
articles  are  in  the  personality  area,  a more  intensive  literature  review 
on  the  use  of  forced  choice  items  to  control  response  set  might  be  in  order. 

Only  one  study  was  found  comparing  forced  choice  items  and  card  sorts, 
while  only  two  were  located  comparing  forced  choice  items  and  check  lists. 
Conclusions  about  them  would  not  appear  warranted. 

Card  sorts.  A majority  of  the  articles  about  card  sorts  addressed 
the  question  of  whether  or  not  forced  or  unforced  sorts  should  be  used. 

The  conclusion  appears  to  be  that  when  the  same  items  are  used,  it  does 
not  make  much  difference  which  system  is  employed. 


Semantic  differential  items.  There  are  a number  of  investigators 
that  advocate  the  use  of  the  semantic  differential,  and  its  reliability  and 
validity  seem  to  have  been  established  in  the  articles  reviewed.  Block 
(1958),  however,  questioned  whether  the  semantic  differential  may  be  a 
rather  complicated  way  of  developing  a measure  that  is  more  readily  and 
reliably  secured  by  other  means.  Since  the  technique  was  initially 
developed  as  a general  method  of  measuring  meaning,  a more  extensive 
literature  review  regarding  its  use  in  questionnaires  might  be  in  order. 

Other  types  of  Items.  Projective  items,  open-ended  items,  check  lists, 
rearrangement  items,  and  matching  items  were  also  discussed  above.  Since 
there  were  few  studies  about  these  types  of  items ,' cone  fusions  do  not 
appear  warranted. 


111-21 


Chapter  IV 


COMPARISON  OF  SCALING  TECHNIQUES 


Once  a selection  has  been  made  of  the  kinds  of  items  that  are  to  be 
used  in  a questionnaire,  it  may  be  necessary  to  determine  scale  values  for 
the  items.  Chapter  IV  is  addressed,  therefore,  to  a comparison  of  scaling 
techniques.  The  literature  review  made  in  conjunction  with  preparing 
this  report,  however,  did  not  stress  articles  on  scaling  techniques,  al- 
though some  were  uncovered.  Since  the  topic  was  not  stressed  and  many 
other  articles  could  be  located,  a discussion  similar  to  those  included 
in  other  chapters  does  not  appear  warranted.  Instead,  the  articles  for 
which  abstracts  were  available  from  the  literature  search  are  listed  be- 
low with  a short  annotation  regarding  their  content.  Comparisons  of 
psychological  scaling  techniques  with  other  types  of  questionnaire  items 
are  discussed  in  Chapter  III  in  those  sections  pertaining  to  rating  scales. 

Ballin  and  Farnsworth  (1941)  developed  a graphic  rating  method  that  had 
scale  values  which  agreed  closely  with  scale  values  obtained  using  the 
Sea shore -Hevner  method  and  the  method  of  equal  appearing  intervals. 

Banta  (1961)  used  the  methods  of  Likert,  Thurstone,  and  a newly  developed 
method  of  Unfold  Partial  Rank  Order  to  measure  attitudes  towards  each  of 
three  referents  differing  in  ambiguity.  The  scores  obtained  from  all  three 
methods  correlated  equally  well  at  each  level  of  referent  ambiguity. 

Barclay  and  Weaver  (1962)  found  that  the  construction  of  a Thurstone  scale 
took  43.2%  more  time  than  the  construction  of  a Likert  scale  with  the 
same  number  of  items,  and  that  the  Likert  scale  was  more  reliable. 

Bartlett.  Heermann  and  Rettlg  (1960)  compared  the  magnetic  board  rating 
technique  to  the  paired  compairson,  ranking,  Likert,  graphic  rating  and 
equal  appearing  intervals  methods.  It  was  concluded  that  all  six  scaling 
techniques  were  equally  accurate  measures  of  scale  value. 

Clark  and  Kriedt  (1948)  applied  Guttman's  scaling  techniques  to  the 
Rundquist-Sletto  attitude  scale.  The  scale  did  not  meet  Guttman's  criter- 
ion for  adequate  scale  undimensionality  despite  the  fact  that  the  internal 
consistency  of  the  scale  was  high.  Thus  the  authors  concluded  that 
Guttman's  method  of  scale  analysis  may  have  serious  limitations  in  the 
area  of  general  attitude  measurement. 

Coombs  (1950)  developed  the  ordered  metric  scale  which  is  based  on  the 
order  of  magnitude  of  the  interval  between  objects. 

Edwards  (1946b) concluded  that  Likert  scales  tended  to  have  higher  relia- 
bility than  Thurstone  scales  and  were  easier  to  construct. 


IV-l 


Edwards  (1948)  refined  Guttman's  technique  for  determining  cutting  points 
by  assuming  perfect  reproducibility  and  making  predictions  of  item 
responses  on  this  assumption. 

Edwards  (1951)  discussed  the  use  of  the  method  of  successive  intervals, 
which  is  a psychological  scaling  procedure  in  which  stimuli  are  classified 
into  successive  intervals  according  to  the  degree  of  some  defined  attri- 
bute which  they  are  judged  to  possess. 

Edwards  (1956)  concluded  that  using  the  method  of  paired  compar isort-  in 
conjunction  with  a set  of  opinion  statements  with  known  scale  values  had 
promise  for  the  construction  of  attitude  scales  with  a relatively  high  de- 
gree of  reproducibility  and  satisfactory  reliability. 

Edwards  and  Kenny  (1946)  established  the  fact  that  it  is  possible  to  con- 
struct scales  by  the  Likert  and  Thurstone  methods  which  will  yield  compar- 
able scores . 

Edwards  and  Kilpatrick  (1948b) described  the  Scale-Discrimination  method 
which  makes  use  of  Thurstone  scaling  procedures,  retains  Likert's  process 
for  evaluating  the  discriminatory  power  of  the  individual  items,  and  meets 
the  requirements  of  Guttman's  Scale  Analysis. 

Eysenck  and  Crown  fl949)  handled  the  results  of  a study  by:  determining 

reliabilities  under  various  systems  of  scoring  (Thurstone,  Likert  and  Scale 
Product);  factorial  analysis;  Guttman  Scalogram  analysis;  plotting  of 
scale  positions  of  items  against  number  of  endorsements,  percentage  repro- 
ducibility, and  factor  saturations;  and  determining  neutral  point  by 
different  methods. 

Farnsworth  (1945a)  found  that  scale  weights  obtained  for  items  using  a 
technique  modified  from  Allport  approximated  weights  obtained  by  Thurstone 
with  sorting.  The  modified  technique  was  a method  where  extreme  items 
were  put  at  the  opposite  ends  of  a series  of  equilength  lines  representing 
the  individual  items  and  where  the  subjects,  in  a group  situation,  checked 
relative  item  value. 

Farnsworth  (1945b)  as  the  result  of  a study  where  judges  of  statements 
were  asked  their  understanding  of  the  distance  between  degrees  of  the 
scale,  questioned  the  use  of  equal  appearing  interval  scales. 

Federico  ( 1971a) studied  Likert  and  Guttman- type  questionnaire  forms.  He 
found  that  Air  Force  students  demonstrated  significantly  more  favorable 
attitudes  toward  analogous  content  areas  on  the  Guttman-structured  items 
than  on  the  Likert-structured  items.  Evidently,  item  formatting  did  affect 
the  degree  of  the  evaluative  assertions  ascribed  to  the  attitude  universe. 

Ferguson  (1939b) suggested  the  following  requirements  for  attitudinal 
scales:  scale  results  correspond  to  underlying  physical  order;  scale 

values  selected  not  affected  by  other  items  in  scale;  attitudes  of  judges 
of  responses  do  not  affect  scale  values;  specific  in  content;  validity; 
reliability;  and  scale  on  a linear  continuum. 


IV-2 


Ford  (1950)  illustrated  a rapid  method  for  determining  whether  a set  of 
six  (or  fewer)  attitude  questions  form  a scale. 

Gardner  (1950)  suggested  a technique  for  obtaining  an  interval  scale  not 
dependent  on  an  assumed  normal  distribution  or  the  selection  of  one  given 
population.  The  units  of  this  scale  are  called  K-units. 

Guilford  (1928)  presented  a method  for  getting  scale  values  which  are 
assumed  to  be  on  an  "objective"  continuum.  These  scale  values  are  in 
terms  of  sigma  units  from  an  assumed  mean  of  all  the  stimulus  values. 

Guttman  ( 1947a ) described  the  Cornell  technique , which  is  primarily  the 
combining  of  data  to  produce  cutting  points  that  minimize  error  of  repro- 
ducibility. 

Gulliksen  & Messick  (19b9)  included  in  their  book  discussions  on:  the  method 

of  successive  intervals;  quantitative  judgment  scales;  similarity  of  stimuli; 
metric  properties  of  behavioral  data;  the  method  of  successive  categories; 
ratio  scales;  partition  scales;  confusion  scales;  and  multidimensional  un- 
folding . 

Hughes  (1967)  compared  the  Thurstone  scale,  a modification  of  the  semantic 
differential,  and  the  check  list  scales  for  their  ability  to  detect  changes 
in  attitudes,  their  test-retest  reliability,  and  their  acceptance  among 
respondents . 

Jahn  (1951)  extended  scale  analysis  along  three  lines;  one, to  include 
alternative  methods  for  reduction  of  a set  of  attributes  to  a single  quan- 
titatively defined  variable;  two , to  include  methods  for  the  reduction  of  a 
set  of  attributes  to  a single  qualitatively  defined  variable  or  qualitative 
types;  and  three,  the  development  of  statistical-experimental  tests  to 
decide  whether  the  theorems  of  scale  analysis  are  to  be  accepted  for  appli- 
cation to  a given  empirically  defined  set  of  attributes. 

Kelley,  Hovland,  Schwartz  and  Abelson  (1955)  found  that  data  analysis  using 
the  method  of  equal  appearing  intervals  did  not  discriminate  judges  with 
extreme  views. 

Komorita  (1963)  demonstrated  a neutral  region  could  be  determined  for 
Likert  scores  but  because  of  the  quasi-scale  characteristic  of  the  instru- 
ment no  neutral  point  could  be  clearly  delineated.  Weighting  content 
scores  by  intensity  as  in  the  Likert  method,  instead  of  using  simple 
zero-one  weights,  had  negligible  effects  on  total  score.  However,  if  the 
number  of  items  is  small,  there  seemed  to  be  some  advantage  in  the  Likert 
me  thod . 

Kriedt  and  Clark  (1949)  concluded  that  the  Cornell  Technique  of  Scale 
Analysis  (Guttman)  can  prove  to  be  very  useful  in  problems  of  psychological 
measurement  providing  discretion  is  exercised  in  the  selection  of  suitable 
problems  and  the  handling  methods. 


IV-3 


Kundu  (1962)  modified  the  scale  points  on  a Likert  scale.  This  modifica- 
tion was  a factor  dividing  rating  method  whereby  the  neutral  point  is 
eliminated  on  theoretical  grounds  and  the  remaining  scale  points  are  not 
fixed  in  advance  by  the  test  author,  but  are  assigned  weights  by  the  re- 
spondent in  accordance  with  his  prevailing  response  bias. 

Likert  (1932)  presented  the  background  and  theory  for  his  measurement 
approaches . 

Prothro  (1955)  found  data  that  supported  Thurs tone's  assumptions  that  the 
sorting  of  items  into  an  attitude  scale  is  independent  of  the  attitude  of 
judges. 

Rozeboom  & Jones  (195b)  stated  that  the  degree  to  which  scale  values  computed 
by  the  method  of  successive  intervals  diverge  from  theoretically  "true"  values 
is  seen  to  be  due  to  three  types  of  error:  error  due  to  inequalities  in 

variances  of  the  distribution  from  which  the  scale  values  are  computed; 
error  due  to  nonnormality  of  the  distribution;  and  sampling  error. 

Saffir  (1937)  made  a comparison  between  scales  constructed  by  the  method 
of  paired  comparison,  rank  order,  and  the  method  of  successive  intervals. 

He  found  mutually  linear  scales,  and  concluded  that  all  the  methods  he 
employed  produced  equally  valid  scales.  Since  the  three  different  methods 
of  gatherin;^  data  (method  of  paired  comparisons,  order  of  merit  method, 
and  method  of  successive  intervals)  and  the  two  different  psychophysical 
techniques  for  scaling  raw  data  (the  law  of  comparative  judgment  and  the 
method  of  successive  intervals)  produced  comparable  scales,  any  one  can  be 
used  with  considerable  confidence, 

Schaie  (1963)  hypothesized  that  the  concurrent  validity  of  questionnaires 
could  be  increased  by  the  use  of  item  weights  obtained  by  expert  scaling, 
instead  of  by  using  conventional  unit  weights.  The  results  showed  only 
low  magnitude  increments  in  validity. 

Seashore  and  Hevner  (1933)  modified  Thurstone's  method  of  equal  appearing 
intervals  by  having  judges  rate  items  on  a nine  point  scale  which  was  print- 
ed on  the  left  hand  margin  of  each  item,  instead  of  sorting  items  printed 
on  separate  slips  into  nine  piles. 

Siegel  and  Schultz  (1962)demonstrated  that  a job  related  technical  skills 
check  list  could  be  scaled  by  both  Thurstone  and  Guttman  techniques. 

Siegel,  Schultz,  and  BenSon *(1960)  hypothesized  that  skills  are  scalable 
in  the  same  manner  (Guttman  and  Thurstone  equal  appearing  interval  scales) 
as  attitudes  and  sensory  phenomena.  Although  their  results  supported  the 
hypothesis,  discrepant  data  raised  some  question  as  to  the  generality  of 
the  hypothesis. 

Siegel  and  Siegel  (1962)  found  that  medians  graphically  derived  and  medians 
from  sorted  judgments  scaled  by  the  method  of  equal  appearing  intervals 
correlated  .97. 


IV-4 


• • 

SjoberR  (1965)  compared  conventional  scaling  techniques  with  his  correla- 
tional scaling  method  using  paired  comparisons  data.  Nonlinear  relations 
between  scales  were  found. 

Stangenberg  (1966)  presented  definitions  of  various  scales  (the  nominal, 
ordinal,  interval,  ratio  and  logarithm)  and  discussed  them  in  terms  of 
measurement  theory. 

Stouffer,  Guttman,  Suchman.  Lazarsfeld.Star , and  Clausen  (1949)  defined  in 
their  book,  the  components  of  a scale  and  discussed  the  limitations  of  the 
use  of  scales.  Their  work  was  a result  of  studies  carried  out  with  military 
subjects  during  World  War  II. 

Taylor  and  Parker  (1964)  found  that  the  graphic  ranting  scales  proved  as 
reliable  as  Guttman  scales, and  an  examination  of  the  interscale  correlations 
showed  that  similar  conclusions  would  be  drawn  from  either  technique. 

Thurstone  (1959)  presented  the  background  and  theory  for  his  measurement 
approaches . 

Torgerson  (1958)  presented  in  his  book  definitions  and  explanations  of 
scaling  methods.  The  book  includes  extensive  numerical  examples. 

Witioy  1 (1954)  experimentally  compared  Thurstone 's  Case  III  and  Case  V 
and  Guildord's  shortcut  approaches  to  scaling  paired  comparison  data. 

The  intercorrelation  between  the  scale  values  obtained  by  the  three  methods 
were  approximately  unity. 

York  (1966)  found  that  Thurstone 's  scale  values  are  stable  over  35  years. 

Zinnes  (1969)  reviewed  the  literature  on  scaling.  The  theme  of 
was  that  scaling  theory  should  be  a theory  of  choice. 


the  review 


Chapter  V 


EFFECTS  OF  VARIATION  IN  PRESENTATION  OF 
QUESTIONNAIRE  ITEMS 


Once  a decision  has  been  made  regarding  the  type  or  types  of  items 
that  are  to  be  used  in  a questionnaire  based  on  the  pros  and  cons  dis- 
cussed in  Chapter  III,  attention  must  be  given  the  actual  development 
of  the  items.  In  this  chapter  consideration  is  given  to  articles  in  the 
literature  that  investigated  the  effects  of  variations  in  the  presenta- 
tion of  questionnaire  items.  Sections  are  included  on  the:  mode  of 

items;  wording  of  items;  clarity  of  items;  difficulty  of  items;  length  of 
question  stem;  order  of  question  stems;  and  order  of  response  alternatives. 


Mode  of  Items 

A series  of  research  studies  were  uncovered  concerning  verbal  versus 
pictorial  presentation  of  items/stimuli  for  subject's  responses.  The 
studies  covered  a variety  of  topical  areas  and  types  of  subjects.  Table 
V-1  summarizes  the  literature  review  conducted  in  this  area. 

Four  studies  found  no  significant  differences  in  subjects'  responses 
to  verbal  and  pictorial  formats  (Blake,  1969;  Greenberg,  1959;  Jensen, 

1930;  and  Rohila,  Shanhdhar  & Sharma,  1966).  Only  one  study,  relating  to 
a consumer  preferences  examination , showed  statistically  significant  differ- 
ences attributed  to  mode  of  item  presentation  (Weitz,  1950).  It  should 
be  noted  that  this  study  did  not  establish  superiority  of  one  format  over 
the  other,  but  merely  noted  differences  on  brand  ratings.  Another  study 
on  the  influence  of  communications  (Luchins  and  Luchins,  1955b)  provided  an 
important  screening  procedure  for  the  use  of  pictures  in  questionnaire 
items.  This  study  suggested  that  conformity  with  false  communications  and 
failure  to  respond  were  higher  for  ambiguous  than  clear-cut  pictures. 
Obviously,  if  pictures  are  to  be  used  they  should  be  pretested  for  clarity 
of  their  presentation  of  the  concept  or  object  to  be  evaluated. 

The  overall  evaluation  of  this  area  of  the  literature  is  that  pictures 
can  be  effectively  eiuployed  in  questionnaires.  This  may  facilitate  obtain- 
ing survey  responses  from  subjects  with  limited  verbal  comprehension  who 
might  have  difficulty  responding  to  questions  employing  lengthy  definitions 
of  concepts  or  objects. 


Wording  of  Items 


The  wording  of  question  stems  and  response  alternatives  is  a critical 
consideration  in  obtaining  valid,  reliable,  and  objective  survey  data. 

For  example,  Payne  (1951)  cited  the  following  illustration.  The  three 
questions  following  were  administered  to  three  separately  matched  samples 
of  respondents  (Payne,  1951). 


Summary  of  Studies  on  Mode  of  Items 


r 


#» 

1-4 

rH 

1 

0 

05 

05 

TO 

U 1 

05 

CO 

E 

■u 

X 

G 

1 

05 

3 <U 

e 

3 

o 

iJ 

T5 

CO 

GO 

a 

■u  u 

00 

05 

05 

cr 

CO 

u 

05 

M 

•H 

4J 

•H 

a 

(/} 

u 

c 

0) 

05 

05 

O 

}-l 

G 

E 

05 

TO 

C 

G 

0} 

•H  M 

o 

•H 

•rH 

•H 

05 

U-l 

o 

G 

<U 

O 

TS 

CU  0) 

u 

4J 

>. 

E 

05 

3 

u-l 

E 

U 

Cl*  4-4 

05 

<U 

C 

J= 

4J 

•H 

1-^ 

5-1 

5-1 

•H 

3 

O 

05 

05 

3 

U 

CO 

M 00 

CO 

05 

r-H 

05 

CO 

•iH 

05 

G 

0 

4-1 

O 

U 

E 

U 

CO 

O *H 

> 

♦r^ 

CO 

M-l 

CO 

,G 

CO 

05 

4-4 

•H 

C 

05 

Q) 

•H 

X 

»+4  x: 

4J 

O 

•H 

05 

4-J 

£50  X 

05 

c 

GO 

05 

05 

4J 

Xi 

O 

O 

<0 

iJ 

05 

X 

•H 

E 

GO 

-o 

1m 

•H 

(/) 

0 

rH 

>.'0 

c 

u 

•r^ 

c 

U 

4J 

T? 

-C 

3 

05 

•r^ 

C 

E 

05 

u 

0) 

Vi 

CO 

o ^ 

s-/ 

G 

1— 1 

05 

o 

•H 

C 

05 

4-5 

05 

•rH 

3 

4-1 

r-4 

1-^ 

o 

Vi 

3 

G 05 

05 

05 

05 

C5 

CO 

05 

3 

U 

•H 

G 

4-4 

TO 

3 

c 

CO 

OJ  «H 

05 

U 

U 

05 

05 

O 

•H 

05 

05 

•H 

3 

cn 

0) 

*r-i 

TJ 

cu 

0) 

0) 

>» 

C 

05 

3 

a 

c 

U 

05 

CO 

T3 

T5 

Q) 

4>4 

> 

c 

•H 

05 

J>5 

o 

3 

GO 

•r-5 

3 

a 

•r4 

<u 

o 

•r-4 

0)  o 

4J 

05 

0) 

TJ 

05 

•H 

♦H 

4J 

TJ 

G 

-C 

> 

M-t 

TD 

U iJ 

•H 

G 

}-i 

C 

05 

E 

4J 

HD 

3 

05 

O 

05 

GO 

C 

•r4 

05 

C 

O 

CO 

4J 

w 

CO 

G 

E 

U 

05 

U 

3 

TO 

•o 

•H 

4J 

•H 

05 

•r-l 

u 

4J 

o 

o 

O 

CO 

i-i 

GO 

C4 

05 

O 

U 

G 

T3 

(U 

1— ( 0 

,Q 

x: 

G 

fu 

•rV 

a 

CO 

G 

4-1 

•r4 

•«-4 

05 

u 

ffl 

CO 

t50 

CO 

G 

c 

05 

05 

05 

G 

4-5 

4-4 

4J 

44 

O 

o 

E O 

•H 

o 

rH 

•H 

O 

3 

05 

X 

1“^ 

x: 

o 

1-^ 

•H 

G 

CM 

u-l 

CO  M-l 

G 

< 

JC 

E 

o 

E 

4-5 

o 

o 

c 

'G 

TO 

c 

O 

C4 

ON 


05 

05 

GO 

GO 

05 

05  VJ 

f-5  V) 

05 

.-4  4J 

1-4  4J 

^ C 

0 C 

O 

O 05 

U 05 

X 

O 'O 

\ -o 

3 

-d-  3 

^4 

in  4J 

<1-  -u 

o 

CM  05 

1-4  05 

CO 

c 


CO  o 

4J 

1j  *r4 

G 

05  4J 

05 

05  C4 

1^ 

4J 

• 

0 

1 

3 *H 

TO 

TO 

05 

G 

4J 

o 

rG 

0 

> 

• 

TO 

05 

05  U 

i-i 

iM 

•(4 

05 

05 

}4 

t-4  05 

05 

o 

• /-s  1-4 

> 

14 

05 

TO  05 

?> 

*'44 

• 

1 

44  'G 

14 

H 

4-5 

0 T3 

05 

1-4 

05 

05 

G 

G 14 

G 

r-4 

CO 

05 

• 

G 

TO  r4 

;> 

3 

O 

^ O 

TJ 

TO 

1-4 

44  1-4 

05 

O 

TO 

O 

G 

CU  u 

3 

rO 

TO 

0) 

TO 

?> 

•i4 

O J-. 

05 

3 

G 

Gc: 

.G 

l4 

44  ^ 

■U 

O 

3 

GO 

05 

G 1m 

•w* 

05 

V4 

3 

O IM 

05 

CL 

1 

O 

•i4 

05 

05 

GO 

> 

05 

4-5 

05 

M 

•H 

r4  f-4 

3 

X 

IH 

14 

TO  G 

r— 1 

05 

> 

u 

05  > 

3 

iM 

TO  TO 

GO 

E 

3 

3 

3 O 

TO 

•G 

G 

•r4 

05 

4J 

u 

3 3 

•H 

TO 

4J 

44 

00  c 

XJ 

TO 

O 

CL 

CL  • 

U 

05 

05  05 

X 

C 

O 

O 

G G 

iM 

•rH 

>,  05 

•rH 

05 

•f4  ‘H 

E 

o 

•i4 

•f4 

TO  14 

G 

iM 

4J  > 

A4 

'O 

> > 

G 

CL 

Ph 

r-4  G 

> 

4J 

Vi 

G 


3 

r-4 

05 

44  O 

■ G 

TO 

05 

G 

O -4 

O C 

c 

G 

G 

0 

4J 

G 

l4 

o 

•rH 

05 

•H 

G TO 

G 

O 

CU44 

05 

•i4 

4J 

U 

4J 

O G 

G 

05 

•H 

O 

4J 

HJ 

05 

o 

TO 

TO 

G -H 

G 

G 

a 

• 

U 

TO 

G 

Li 

44 

c 

G G 

IM 

IM 

o 

,C  05 

3 

cu 

U 

c 

1 

•r4 

3 3 

G 

3 

H 

O G 

T3 

3 

G 

G 

G 

0 

1-4 

44 

05 

>1  1-4 

O 

U 

4-1 

> 

3 

TO 

44  0 

G 

TO 

05  ‘H 

IM 

G 

G 

G 

U 

X 

G O 

IM 

C4  44 

CL 

O 

•rH 

•rH 

H 

G 

H G 

CL 

E 

I 


'O 

3 

u 

c/5 


m 

/-N 

m 

td  vO 

T3 

ON 

v3 

C 

1-4 

iH  On 

GO 

TO 

N-/ 

TO  1-4 

14 

G 

05 

05 

*•  *0 

X 

/•“S 

G ^ 

C 

c 

« j:  to 

c 

G 

G O 

•f4 

•rH 

r-l  J<i  a 

G m 

.X  vO 

05  cn 

X 

x: 

■H  c C 

G <7> 

TO  ON 

G C3N 

G 

G 

ra  n 

l4  r-4 

r^  r-4 

G 1-4 

3 

3 

o ^ 

o 

PQ  W 

hO 

X 

OS  c/3  CO 

V-2 


(Table  continued  on  next  page) 


TABLE  V-1  (cont 


r 


/ 


cn 


CO 

DO 

CD 

c 

^4 

U 

•H 

CO 

c 

«J 

JP 

<u 

CO 

u 

<D 

CO 

cu 

> 

o. 

4-1 

•^3 

D 

44 

C 

s 

o 

(0 

•H 

CO 

o 

M 

u 

'O 

}-< 

0> 

DO 

r—l 

36 

P 

•u 

44 

0) 

CO 

c 

1-4 

<u 

M 

Q) 

CO 

1-4 

Xi 

Pi 

u 

CO 

44 

•i4 

V4 

TJ 

U 

44 

c; 

o; 

*r4 

•H 

> 

44 

cu 

c 

o 

CO 

DO 

•H 

T3 

•H 

c 

X 

c 

CO 

o 

(U 

CO 

f 


f 

t 


O 


CO 

o 

TJ 


c 

o 

CO 

o 

•H 

P 

U 

CO 


4-1 

o 


3 

CO 


«s 

CO 

0) 

o 

o 

B 

V4 

44 

CO 

3 

c 

o 

CO 

44 

i 

o 

CJ 

r-4 

• 

•H 

B 

- 

CO 

CO 

a 

• 

CO 

CO 

44 

X 

> 

CO 

0) 

0) 

D 

u 

CO 

> 

M 

44 

O 

q; 

c; 

•f4 

H 

p 

>, 

> 

u 

> 

1-4 

X 

CO 

•H 

CO 

1-4 

•M 

f— 4 

CO 

CO 

CO 

X 

CO 

u 

c 

Q> 

u 

44 

c 

c 

44 

<U 

u 

44 

c 

o 

o 

P 

> 

0) 

CO 

DO 

•H 

X 

> 

•f4 

44 

•p4 

Cl 

4*i 

CO 

• 

CO 

o' 

c 

<1) 

u 

d) 

44 

CO 

44 

D 

44 

1 

<U 

cr 

CO 

I 


CO 

(U 

CkO 

c 

4-1  CO 

O U 


<u  CO  OO 
£ DO  C 
p a 


CO  *H 

C J-> 
O CO 
O M 


o 

o 

CJ 


>4 


T3 

COi 


o 

m 

CTn 


<D 

3 


1. 


Do  you  think  anything  should  be  done  to  make  it  easier  for  people 
to  pay  doctor  or  hospital  bills?  (82%  replied  "yes") 


I 


2.  Do  you  think  anything  could  be  done  to  make  it  easier  for  people 

to  pay  doctor  or  hospital  bills?  (77%  replied  "yes") 

3.  Do  you  think  anything  might  be  done  to  make  it  easier  for  people  to 

pay  doctor  or  hospital  bills?  (63%  replied  "yes") 

These  questions  differed  only  in  the  use  of  the  words  should,  could  and 
might,  terms  that  are  often  used  as  synonyms  even  though  they  have  differ- 
ent connotations.  .The  197o  difference  at  the  extremes  is  enough  to  alter 
almost  any  survey's  conclusions.  This  example  illustrates  a key  feature 
of  much  of  the  evidence  on  question  wording  --  it  is  extensively  topic 
bound.  Most  of  the  studies  dealing  with  framing  questions  were  so  broad 
in  scope  that  no  single  source  of  bias  was  given  concentrated  attention 
(Belkin  and  Lieberman,  1967;  Hubbard,  1950).  It  is  difficult  to  general- 
ize from  the  literature  to  a specific  survey  situation. 

The  literature  review  conducted  for  this  section  uncovered  several 
articles  and  books  purporting  to  offer  "principles  of  question  wording" 
(e.g.,  Payne,  1951;  Roslow  & Blankenship,  1939;  Blankenship,  1942).  Most 
of  the  material  presented,  however,  is  based  on  experience  rather  than 
empirical  research,  and  tends  to  be  more  prescriptive  than  positive,  more 
indicative  than  imperative. 

The  stress  in  the  discussion  of  the  literature  presented  below  is  on 
topics  which  have  been  discussed  in  some  detail:  positive  versus  negative 

wording  of  items;  objective  versus  subjective  wording  of  items;  and 
definite  versus  indefinite  article  ywCr ding.  A section  on  miscellaneous 
studies  on  questionnaire  wording'has  also  been  provided  to  list  areas  which 
have  been  examined  as  "one-shot"  efforts. 

Positive  versus  ne.gative  wording  of  items.  One  topic  in  question 
wording  which  has  received  considerable  attention  is  statement  polarity, 
positively  versus  negatively  phrased  question  stems.  Table  V-2  summarizes 
the  literature  on  this  topic.  It  should  be  noted  that  all  the  studies 
except  Adams  (1956)  concern  question  stem  wording.  Only  three  studies 
(Adams,  1956;  Githens,  undated;  and  Waters,  1966)  were  unable  to  find  an 
effect  on  study  results  produced  by  positive  versus  negative  wording. 

Eleven  studies  reported  significant  effects  on  a variety  of  measures,  such 
as  reliability,  validity,  and  suggestibility  (Blankenship,  1940a;  Burtt  & 
Gaskill,  1932;  Campbell, Siegraan  & Rees,  1967;  Cloud  & Vaughn,  1970;  Edrich, 
1965;  Falthzik  & Jolson,  1974;  Hubbard,  1950;  Muscio,  1916;  Rugg , 1941; 
Rundquist,  1940;  and  Wembrldge  & Means,  1918).  In  general  these  studies 
produced  evidence  that  alternative  positive/negative  (or  neutral  wordings) 
can  produce  demonstrable  effects  on  survey  results  --  a conclusion  not 
arguing  for  either  form  of  phrasing  but  mere  recognition  that  differences 
in  results  existed. 


V-4 


Summary  of  Research  on  Positive  Versus  Negative  Wording  of  Items 


r 


3 

3 

C 

3 

0 

C 

ON 

u 

•H 

3 

1 

4J 

3 

4J 

V4 

CH 

c 

t—4 

3 

> 

3 

3 

o 

*d 

>* 

3 

3 

•H 

O 

4-1 

•H 

3 

B 

M-l 

00 

60 

3 

o- 

•M 

3 

•H 

3 

3 

3 

'd 

c 

3 

0 

3 

O 

d 

0 

v« 

o 

> 

CL 

0 

1— 1 

c 

4J 

CL 

•H 

•H 

1— t 

3 

3 

3 

3 

4^ 

» *» 

4J 

3 

to 

u 

'd 

3 

U 

'd 

3 

B 

4^ 

c 

c 

60 

3 

•H 

3 

3 

3 

o 

c 

•r-> 

I— 4 

3 

%,✓ 

3 

u 

(X 

•H 

3 

y— V 

3 

U* 

3 

•H 

3 

3 

rH 

3 

O 

> 

3 

4J 

3 

4^ 

3 

d 

3 

3 

c 

0^ 

•r4 

3 

> 

4J 

?» 

3 

c 

S— ✓ 

4J 

•H 

> 

3 

•r4 

4J 

1 

1 60 

•H 

o 

4J 

•H 

O 

4J 

3 

•iH 

3 

> 

3 

4J 

e 

o 

•H 

3 

0 

•H 

3 

c 

3 

3 

3 

3 

JZ 

M-l 

60 'd 

•«n 

c 

o 

4^ 

c 

c 

3 

3 

c 

o 

z 

•r4 

•f4 

3 

Z (N 

O 

O 

to 

c » 

•H  QJ 

3 

O tQ 
»H  O 


o c 

0)  M-l 
U X 

w 

'O  /-> 

0)  (0  (A 

03  cA  c: 
c 


3 
CO 
U <U 
> 

% 


03  a* 
4J  0) 

to  W)  'O 
^ CiO  CO 
0)  3 0) 
Z W ^ 


- 'O 
C Q) 
O 'O 
•3  C 

§ 
'*  B 
o <u 
C 

#*  CA 
CO  U 
Q)  3 

>>  3 
' 10 
C 

I— I (0 
<0 

O /-s 
•H  3 

u o 
o c 

OC  X 


4J 

u 

0> 


Q> 

(0 

60  3 
C f-< 

•H  <0 

-o  o 

U 03 

0 

3 •-* 

1 t-( 

CO 

O 

‘ c 

C -H 
O 'O 

•H  4J  <1) 

^ c c 

O 0)  *H 
o;  03  0 
3 (0 
’MUX 
C)  &<  3 


60 

c 

•M 

3 

CO 


3 

3 

>> 


'3 

3 

O 

3 

'd 

3 

3 
60  3 
C -H 
’M  U 

o c 

C 3 
3 'd 
r-i  C 
3 3 

pa  4J 


3 

B 

a 

3 

3 

C 

> 

^4 

4J 

H 

3 3 

C 

3 

0 

3 

' — 

U CO 

3 

60 

c 

60 

3 

c • 

T3 

3 

3 

4J 

3 C 

•r4 

1-4 

c 

1— 1 

3 

’d  o 

3 

1—4 

d 

1—4 

3 

•H  4-» 

3 

O 3 

\ 

O 3 

3 3 

U 

U 4-> 

c 

U 4J 

3 C 

C 

5 

c 

d 

*H 

O 

O 3 

o 

CO 

o 

ON  'd 

c 

m T3 

1—1 

o 

o d 

-id 

00  d 

o 

't  ^ 

•V 

r.  XJ 

c 

4J 

z. 

cn  0 

cn 

1—1  3 

d 

>-l  3 

3 

1 

3 

> 

U 

V4 

3 

3 

•M 

3 

O 

3 

> 

■U 

1 

»» 

•H 

/«— s 

•H 

M 

3 

3 

}-( 

3 

4J 

3 

CM 

3 

O 

> 

•d 

3 

0 

3 

to 

60 

4-1 

( 

0 

O 

•M 

3 

60 

3 

4-1 

U 

♦ 60/^ 

C 

C 

Cl 

>1 

4J 

>1 

• 

U 

d 

3 

4J 

3 

•r-}  Q)  • 

Q 

1 

fM 

•r4 

3 

60 

c 

'—1 

3 

4J 

XJ  C 

g 

0 

• j 

'd 

3 

3 

*d  jx 

3 

i 

x: 

3 

XJ 

4J 

o x> 

CO 

4J 

3 ( 

3 

> 

O 

c 

4J 

eC 

3 

T5  d 

CO 

> M 

-4 

4J 

•H 

a 

3 >1 

D 

u 

• 

XS 

C 3 

3 

3 

CO 

•-4 

• 

3 

44 

3 

3 CO 

c 

U 

3 3 

C 

4J 

CO  ^ 

*d 

*d  3 

3 

3 

»r4 

*' 

> 

U 

> 4^ 

o 

u 

> ; 

> 

o 

3 

60»-4 

3 

3 > 

O 

U 

u 

o 

’M  /-s 

•r4 

•H  *1 

H 

•M 

3 CO 

O 

04 

4J 

3 

3 

3 

•U  • • 

4J 

u 

4J  4 

b 

4b 

3 

C M 

c 

3 4J 

3 

3 

U 

3 

3 

U 

• r4  »r-)  •!— j 

3 

3 

CO  •' 

H 

3 

B 

4J 

3 

^ 3 

B 

U 

B 

3 

3 xa 

3 

x: 

60  3 

3 

3 

''  d 

1—4 

60 

3 

3 

3 

O 

CO 

x: 

0 3 0 

3 

4J 

3 ( 

3 

d 

4J 

>1  3 

S 

>1  3 

M 

< 

rO 

3 

3 

3 

Cb  3 

U* 

O 

Z 1 

CL 

cr 

M 

^ c 

»-»  C 

•H 

T3 


u 

1—1 

u 

3 

3 

3 

5 

•H 

O 

3 

O 

U 

X) 

0 

3 

4-1 

M 

3 

>1 

3 

M 

3 3 

CM  P 

p 

•H 

3 

3 

i 3 

o c 

3 

•H 

P 

3 

fc  »H 

3 3 

c ^ 

iM 

3 

U 

T3 

0 -p 

d 0 3 

0 3 

3 

> 

•H 

d 

o ‘H 

O C *H 

•H  3 

C 

U 

0. 

4b 

1— f 

•M  M a 

P U 

O 3 

3 3 

o 

—1 

>1  -H 

d 3 *H 

3 3 

3 P 

3 4 

e-1 

■U 

P L> 

> 1-4 

3 3 

U 3 

C 3 

4b 

•M  (0 

a 0 o 

d 3 

3 3 

O U 

< 

U CM 

o 60  a 

O'  c 

Cb  P 

o 3 

/-s 

3 

CM 

3 

/— \ 

cn 

3 

O 

NO 

CL 

On 

z 

in 

'M 

ON 

1 

ON 

x: 

*d 

T3  t-4 

3 

c 

C 

H 

W 

C ^ 

3 *P 

tM  c 

3 

TJ 

3 *0 

^-4 

•P  3 

d 

d 

3 

-itJ  O 

P -H 

3 E 

•d  x: 

p 

0 

C sf 

P ^ 

X)  60vO 

d 60 

CO 

3 

3 On 

M 3 

0 3 ON 

o d 

*d 

iM  rb 

d 3 

3 'M  1—4 

tM  3 

1 

< 

PQ  w 

CO  C*' 

O C/3  w 

o > 

V-5 


J 


(Table  continued  on  next  page) 


c 

o 

u 


9 

PQ 


0> 

I* 


W)  c 

o>  o 

• CO 
CO  0^ 

O M 

Oh 


3' 


4) 

42 

1 

1 J3 

•u 

o 

u 

•H  60 

•l^ 

1-4 

1 

iJ 

«-<  o 

-H 

3 

CO 

0^ 

3 

u > 

•H  «C 

♦i4 

u 

•H 

C CO 

c w 0) 

44 

CJ  .*. 

CO 

CO 

60  > 

c 

0> 

0) 

o 

u c 

‘H  CO  *H 

<D 

O.  CO 

^4  " 

to 

•H  3 

(0  £ -u 

B 

CO  g 

0)  vO 

•H 

(0 

U-t 

0) 

0)  0) 

(0  44 

44 

0) 

•H  U 

>%•!-»  10 

o; 

44 

W P 

CO 

L< 

c <u 

4J  -H  O 

M 

O 0) 

60 

O 

60  -U 

•H  a. 

60  T> 

P>  -o 

0) 

3 

•H  <0 

V4  CSl 

CO 

0)  T3 

c c 

3 

•rH 

CO  <D 

CO  -C 

CO 

(0  0) 

•r4  O 

Vh 

1-c  <u 

'H 

O N 

CL 

CO 

B 

W 00  C 

o 

•r^ 


c 

? 

o 

c 

c 

3 

c 

3 

O 

c 

ff 

:d 


X <u 


c 

<u  3 
> 

U-i 

o>  o 

c 

O 


O 4H  *r 

Cl.  O 5 


•o  a.  ••-<  <u  CO 

U 0) 

c u 


u 
60  O 
C 


(0 

0) 

(A 

CD  'O 
Vh  N 

CJ  -H 

01 

'3  (Q 
C 

'V  o 

C V) 
to  M 
0) 

>»  a. 

4J  W 
•rH 

>-i 


I 

•W  TJ 
U 0) 
CO  tg 
0)  ’1-i 
60  »-• 
60  <0 
3 C 
CO  O 
10 

CO  Vh 

0>  0) 


O. 

c 

o 

c 


1-4 

4J 

CO 

44 

rv. 

4J 

<5 

t-4 

3 

0) 

3 

44 

•H 

JD 

•i4 

•f4 

CO 

CO 

•H 

V4 

3 

3 

0) 

O 

u 

o 

O 

u 

u 

3 

•rH 

1-4 

> 

> 

3 

<v 

O 

C 

0) 

44 

> 

CO 

CO 

44 

♦H 

a; 

3 

O 

44 

3 

> 

£ 

O 

B 

•i4 

Li 

a; 

44 

L4 

'D 

♦i4 

to 

,n 

O 

0) 

L( 

o 

Q> 

”3 

44 

0) 

•H 

'O 

CO 

L4 

O 

44 

tu 

CO 

•H 

44 

0) 

Ql 

44 

4J 

<U 

O 

3 

CL 

3 

0) 

(J 

0) 

L4 

to 

60 

•i4 

to 

♦i4 

CO 

3 

1-H 

CO 

3 

Li 

to 

60 

cr 

44 

3 

44 

o 

60 

L< 

r-H 

U 

o 

.3 

44 

CO 

60 

O 

0> 

3 

u 

CO 

T3 

o 

3 

3 

3 

0) 

a> 

•i4 

CO 

r-4 

CO 

CO 

U 

CO 

CL 

3 

•H 

< 

1-4 

0) 

M 

05 

to 

Li 

> 

TJ  <r 
C r*^ 
(0 


Li 

3 

O 

3 

U 

O 

•rH 

3 

C44 

<4-1 

3 

3 

3 

3 

3 

U 

3 

O 

H 

o 

3 

O 

3 

4*2 

4s: 

3 

s 

3 

P 

'w' 

3 

•o 

T3 

'O 

Li 

Li 

Li 

3 

3 

3 

3 

0) 

3 

3 

60 

3 

Li 

O 

O 

3 

O 

Li 

< 

44 

44 

•rH 

44 

0) 

O 

0) 

C4 

3 

0> 

•H 

<u 

Li 

•(4 

'O 

T3 

Li 

T? 

3 

a 

3 

<U 

3 

CL 

3 

O 

o 

44 

o 

44 

44 

H 

•H 

Li 

•i4 

44 

•rH 

44 

60 

44 

•rH 

44 

> 

44 

<V 

44 

3 

U 

3 

C 

z 

< 

3 

< 

z 

60 


a 

>>"0 

CO 

C 

u 

r-4 

3 

44 

O 

3 

3 

■U 

O 

B 

3 

?> 

3 

3 

3 

» ■ 1 

•rH 

44 

CL 

N-X 

3 

to 

44 

3 3 

to 

3 

3 

t“4 

P 

£ 

•r4 

e 

*• 

44 

44 

3 

3 

3 

3 

3 

to 

U 

O 

Ll 

• 

44 

O 

r-4  44 

3 

3 

3 

• 

44 

•rH 

CL 

G;  •rH 

O 

a 

CL 

60 

D 

> 

?> 

•rH 

to 

3 

3 

3 

44 

'O 

•rH  44 

3 

3 

3 

3 

3 3 

3 

Li 

3 

44  Ll 

Ll 

' — 

U 

s 

3 

•|4 

3 3 • 

3 

• 

• 

• 

• 3 

3 

4*2 

Li 

60  4*; 

> 

3 

60 

to 

W)  J 

44 

•^ 

3 

3 *H 

O 

3 

o 

<u  u 

•rH 

> 

3 P 

CO 

CL 

3 

CL 

Z o 

C 

5 

O 

c 

:d 


^H 

4*: 

'-r' 

'O 

•rH 

3 

3 

33 

£ 

/-N 

3 

3 

44 

P 

u 

in 

O 

3 

3 

3 

o 

•rH 

vD 

44 

3 

42 

33 

rO 

in 

Ll 

a* 

^H 

r-H 

44 

3 

rO 

ON 

33 

i-H 

3 

O 

•rH 

3 

3 

rH 

Ui 

W 

pH 

*-) 

o 

32 

>—/ 

V-6 


V 1 


(Table  continued  on  next  page) 


Summary  of  Research  on  Positive  Versus  Negative  Wording  of  Items 


'O 

CO 

1 

44 

c 

•H 

4) 

c 

u 

c 

o 

o 

44 

o 

4/ 

•o 

> 

•H 

»-4 

>4 

CO 

•f4 

44 

44 

0) 

•r4 

4) 

U 

»-4 

o 

44 

c 

X 

4) 

CO 

44 

CO 

CO 

'O 

•* 

e> 

CO 

4> 

CO 

•H 

a; 

44 

3 

CO 

■U 

4) 

rH 

'O 

44 

1 

4) 

CO 

CM 

to 

4) 

to 

3 

3 

1-4 

'O 

T? 

T3 

o 

CO 

4> 

C 

«; 

> 

1 

CO 

a* 

> 

O 

3 

00 

o; 

4) 

O 

•i4 

44 

4) 

r-4 

V4 

o 

c 

•i4 

1 

? 

X 

O 

•r4 

u 

44 

u 

00 

u 

44 

'U 

o 

CO 

u 

B 

CO 

CO 

TD 

CO 

•r*4 

4) 

m 

4J 

P 

tJ 

CO 

CO 

4) 

4) 

P 

4) 

to 

4^ 

4) 

f-4 

P 

U 

■u 

CO 

9) 

x: 

X 

a 

<V 

O 

}4 

,T3 

M 

(1) 

o 

o 

4) 

4> 

44 

44 

44 

c 

c 

>> 

•i4 

44 

to 

M 

O 

o 

3 

•H 

4-1 

44 

4) 

> 

•H 

•H 

•i4 

<u 

4) 

44 

1-4 

O 

44 

B 

O 

CM 

4-4 

>4 

•rt 

o 

? 

3 

? 

X 

X 

1 

P 

CO 

CO 

CM 

CM 

CO 

O 

0) 

•H 

44 

3 

3 

4) 

3 

e 

CO 

0) 

CO 

4J 

i“4 

X5 

CO 

4^ 

CO 

CO 

CO 

Oi 

o 

>» 

44 

}4 

to 

1-^ 

c 

3 

00 

}4 

4) 

4> 

4) 

o 

O 

•r4 

C 

44 

44 

rM 

U 

4J 

c 

4) 

a 

o 

CO 

4) 

(U 

CO 

PN 

>> 

c 

c 

1-4 

CO 

•H 

i-t 

4> 

3 

•i4 

1-4 

CO 

•H 

•H 

> 

c 

X 

• A 

0> 

CO 

44 

••n 

}4 

a 

CO 

o»l 

JJ 

— 1 

•H 

4) 

T3 

T3 

'XJ 

T3 

na 

CM 

44 

4) 

44 

1-4 

o 

13 

Q) 

e 

c 

o 

a 

44 

44 

44 

•i4 

•i4 

•H 

•H 

o 

44 

> 

CO 

3 

4^ 

3 

C 

3 

•H 

o 

CO 

u 

U 

3 

•H 

CO 

CO 

CO 

•0 

CO 

}4 

c 

•i4 

eM 

u 

CM 

CO 

CO 

CO 

CO 

n4 

4> 

o 

c 

CO 

CO 

CO 

CO 

CO 

CO 

4) 

0) 

44 

4) 

•H 

CM 

C 

44 

0) 

u 

•H 

E 

j: 

44 

CO 

V4 

44 

4) 

44 

4> 

CO 

P 

CO 

4J 

CO 

45 

44 

44 

6*2 

S'? 

4) 

bO 

c 

W) 

U 

CO 

E 

CO 

4> 

i 

c 

o 

3 

•i4 

4) 

04 

CO 

CM 

44 

•f4 

o 

4; 

o 

CO 

O 

4> 

»M 

p 

X 

3 

1 

M 

E 

CO 

00 

r>. 

sO 

iTi 

M 

o 

o 

a 

:z 

H 

•H 

44 

or 

CO  £ ^ 

u o 


<U  4)  C 
> > iJ  ^ 
•H  *H  U E 
4J  <U  4)  4J 
•H  eo  •'-1  CO 
CO  00  ^ 0} 

o Q)  :3  u 

Pu  C <0  4J 


• CO 
CO  0)  u 
> Xi  V 

u u 

0)  cj 

O 0)  o 

r-»  a.  s 

f-H  CO  0> 

< ^ T3 


o 

•'  M 

4) 

4)  CU 

> 

*H  CO 

•r4 

CO 

E 

44 

3 CO 

CO  Q) 

CO  > 

CO  B 

to  44 

B -H 

U Q) 

4)  ’H 

U 44 

o;  44 

C 

•H  TO 

> -H 

rH 

CM  to 

»'  eo 

CM  4) 

4)  4) 

<U  M 

CO  C 

;>  > 

> 4J 

•i4  ‘M 

•H  3 

0)  4) 

44  44 

44 

1-4  r-4 

•M  C5 

•H  C 

Pa  Pa 

cn  to 

CO 

B B 

O 0) 

O M 

•M  »H 

PL4  O 

W CO 

CO 

4) 

M 

3 

>1 

CO 

44 

CO 

•H 

4) 

1-4 

E 

CO 

c 

C 

3 

to 

O CO 

o 

c 

CO  44 

p 

•M 

M CO 

44 

4)  4) 

c 

O 

04  44 

33 

> 

✓— N 

'O 

x> 

C /-S 

x> 

CO  00 

O' 

1-4 

44 

1-4 

4>  On 

(0 

to  ^ 

•H 

•"O  ' 

3 

CO 

cr 

u 

M CO 

'O 

4) 

C 

c 

44 

E CO 

3 

CO 

0^  4) 

3 

3 S 

Several  additional  similarities  in  results  among  the  studies  were 
present.  Both  Burtt  and  Gaskill  (1932)  and  Hubbard  (1950)  reported  that 
introducing  a negative  into  the  question  form  increased  respondent  sug- 
gestibility ■ That  is,  there  was  a tendency  for  the  direction  of  the 
question  stem  to  be  chosen  in  the  response  alternatives.  A potentially 
important  interviewing  variable  was  pointed  out  by  the  comparable  find- 
ings of  Falthzik  and  Jolson  (1974)  and  Hubbard  (1950)  . These  two  studies 
illustrated  a tendency  for  statement  polarity  to  be  more  significant 
when  personalized  (what  a person  says  about  himself)  than  nonpersonalized 
(what  he  says  about  others  or  external  events).  Furthermore,  their 
results  indicated  that  when  a personalized  question  was  changed  to  a non- 
personalized version,  suggestibility  was  decreased.  Two  other  studies 
cast  doubt  on  the  use  of  negatives  in  question  stems.  Muscio  (1916),  in 
assessing  the  reliability  of  subjects  reporting  events  they  had  just  ob- 
served, reported  that  the  most  reliable  question  form  was  a subjective 
directed  question  without  negatives.  In  a study  of  appropriate  question 
stems  for  voting  measures,  Wembridge  & Means  (1918)  reported  that  respond- 
ents took  greater  time  and  were  more  confused  with  negative  and  especially 
double  negative  (i.e.  minors  should  not  be  forced  not  to  smoke)  stems 
than  simple  affirmative  versions.  These  findings  are  contradicted  by 
Payne  (1951)  who  reported  from  several  studies  that,  when  people 
have  strong  convictions,  the  wording  of  the  statement  should  not  greatly 
change  the  stand  they  take.  Rundquist  (1940)  also  suggested  that  nega- 
tive items  in  a series  of  personality  measures  tended  to  have  greater 
internal  consistency  than  positively  phrased  items. 

In  conclusion,  loading  by  statement  polarity  choice  may  be  unavoid- 
able but  can  cause  differences  in  research  results.  It  can  even  be 
desirable  when  evaluating  policies  or  objects.  But  when  a particular 
phrasing  is  employed  to  present  a distorted  view  of  opinion  or  the  view 
in  which  the  researcher  thinks  is  "right,"  it  becomes  an  evasion  of  truth, 
or  the  direct  opposite  of  research  (Payne,  1951). 

Objective  versus  subjective  wording  of  items.  Eight  studies  were 
uncovered  relating  to  the  effects  of  stating  question  stems  in  an  objec- 
tive or  subjective  direction.  A study  published  by  Muscio  (1916)  is 
illustrative  of  the  research  in  this  area.  Fifty-six  subjects  were  exposed 
to  a sequence  of  pictures  and  then  asked  if  they  saw  certain  objects  in 
them.  The  study's  dependent  variable  was  suggestibility,  or  the  degree  to 
which  subjects  said  "yes"  to  these  objects  whether  they  were  present  or 
not.  Muscio  concluded  that  changing  from  the  subjective  ("Did  you  see  a 
hat  in  the  picture?")  to  the  objective  ("Was  there  a hat  in  the  picture?") 
reduced  suggestibility.  Table  V-3  summarizes  additional  evidence  in  this 
area . 


Muscio's  evidence  regarding  objective-subjective  direction  and 
suggestibility  has  been  supported  by  empirical  studies  conducted  by 
Blankenship  (1940«),  Dohrenwend  (1965),  and  Hubbard  (1950).  The  only 
conflicting  evidence  uncovered  in  the  area  was  presented  by  Burtt  and 
Gaskill  (1932)  , who  reported  that  the  objective  form  showed  greater  sug- 
gestibility . 


V-8 


Summary  of  Research  on  Objective  Versus  Subjective  Wording  of  Items 


1 

4J 

1-^ 

Vi 

0) 

U 

U 

T3 

1 

CD 

4J 

4) 

41 

41 

- 

•r4 

CO 

60 

01 

x: 

CO 

x: 

U 

^4 

c 

C4-( 

M 

OJ 

> 

c 

41 

U 

00 

U 

o 

•H 

r-< 

•w 

o 

41 

Vm 

01 

•IH 

o 

c 

♦H 

4J 

u 

4J 

•H 

00 

x: 

“ 

x: 

u 

o 

4.1 

c 

CO 

c 

0 

c 

■U 

CO 

9) 

CO 

00 

CO 

•H 

CO 

41 

C 

CO 

0 

•rH 

• 

D 

•M 

•M 

»a 

3 

TS 

CO 

•“4 

'U 

o 

0 

T3 

00 

u 

o> 

x: 

O 

to 

1-1 

41 

•H 

CO 

u 

(0 

M 

•H 

OO 

0) 

(X 

3 

c 

4J 

41 

> 

>-l 

u 

(0 

QJ 

D 

o 

O 

-o 

•H 

> 

41 

C4-t 

41 

CO 

a> 

01 

a 

z 

CO 

CO 

c 

u 

x: 

c 

c 

u 

U 

0 

CL 

u 

> 

> 

•H 

a 

CO 

CO 

o 

•r^ 

41 

1 

3 

60 

o 

c 

4J 

a 

- 

•iH 

<4-1 

> 

VM 

P^ 

u 

c« 

u 

o; 

4J 

•r4 

u 

to 

4^ 

CO 

G 

CO 

c 

CO 

♦H 

•H 

r-4 

4J 

41 

4J 

o 

X 

> 

(0 

Ci 

to 

V4 

CO 

o 

u 

o 

4J 

41 

•r4 

x: 

CO 

•H 

<1) 

60 

o 

o 

(0 

o 

4) 

T5 

41 

41 

u 

CO 

u 

Li 

o 

c 

0 

1-^ 

0 

3 

C 

> 

a 

41 

•p4 

0 

CO 

d> 

u 

CJ 

O 

41 

CO 

•i-n 

u 

<4-1 

Oi 

> 

0) 

c« 

0) 

0) 

«) 

c 

41 

> 

X3 

41 

•H 

<4-( 

•H 

> 

> 

?> 

> 

O 

> 

41 

3 

u 

U 

0 

U 

•H 

0) 

•H 

•r4 

•r>< 

4J 

•H 

CO 

(0 

<0 

41 

U 

rM 

o 

CO 

eO 

4J 

4J 

' -u 

U 

CO 

}-< 

4J 

C 

41 

a 

c 

•r4 

(« 

0) 

O 

(J 

O 

— 

U 

01 

41 

O 

0 

• n 

CO 

o 

•H 

<U 

41 

01 

c 

41 

00 

XI 

41 

CL 

CO 

00 

•H 

<U 

1-^ 

CO 

01 

••n 

o 

•»-> 

00 

0 

•f~) 

CO 

CO 

*0 

4J 

u 

0 

4J 

N 

XI 

Xi 

T3 

3 

3 

XI 

41 

41 

o 

c 

CO 

U 

£ 

•w 

•H 

o 

O 

o 

O 

CO 

N 

C 

o 

1-1 

C 

4J 

CO 

U 

41 

1 

a. 

•r4 

> 

CO 

H 

41 

41 

u 

00 

CO 

41 

4J 

• 

C 

^4 

41 

u 

5 CO 

00 

41 

O 4J 

o w 

01  CO 

•r* 

4J  C 

O 4J 

f-4  Li 

Xi 

41 

c 

t-i  c 

3 

O CO 

O T3 

O 41 

O 41 

CO 

O U 

o *H 

Os  -O 

O 'O 

O C 

O CO 

O 3 

^ 3 

o 

. <y 

Q> 

»'  4J 

CM  LJ 

z 

CO 

1-H  CO 

CO  CO 

0) 


.r-» 

•H  0 

41 

XI 

XI 

> 

3 

U ^ 

c 

CO 

41  • 3 

Li 

o 

. o 

t 

O 

CO 

•H 

MM 

XI  • c 

XI  3 

41 

4-1 

4.1 

0 

3 • ^ 

CO 

3 41 

c 

(0 

CO  3 

0 

CO  *H 

41 

41 

41 

O 4J 

Li 

?> 

3 

0 

3 

41 

• >>• 

• 0 

• Lt 

CO 

Li 

a* 

U 

CO  C 

CO  mm 

V)  41 

CO 

OO 

?»  O O 

> 

> XI 

41 

41 

41 

O T3 

41 

C 

> 

U 

0 

•o 

41  w (0 

41  > 

0)  ‘iM 

CO 

•H 

u 

> '•41 

> -H 

> 

c 

4J 

CO 

00  • 

•H  00  O CO 

•H  4J 

•H  41 

o 

U 

0 

c 

Li  C c c 

XI  o 

XI  > 

•fM 

41 

u 

•H  X3 

Cl  *H  o 

O 41 

O -H 

4J 

••n 

0 

0 

41-0  ''a 

41  fn 

01  4J 

CO 

MM 

V4 

W CO 

••n  X> 

.»-»  u 

41 

o 

CO  O 

XI  o 41  41 

X)  3 

XI  41 

3 

m 

> 4J 

O 3 

O w 

o 

or 

x> 

4J 

c 

o 

1 

41 

< 

TJ 

a 

o 

c 

XI 

41 

»rH 

4J 

•H 

•H 

o 

N OO 

1 

4-1 

2 

41 

•H 

•H  C 

c 

(f 

0 

IM 

CO 

X 

a 

^M  »r4 

Cl 

3 

3 

3 

CO 

o 

«3  XI 

41  ..M 

CO 

4M 

& 

41 

H 

00  Li 

CO 

U 

0 

X 

m 0) 

0 o 

> 

MM 

•rM 

CO 

41 

hS  XI 

o a 

o 

a. 

a 

X 

CL 

a 

•H 

•rM 

T5 

x: 

X 

•o 

C 

n 

c 

41 

> 

C ^ 

i 

/•"S 

9 

rX 

3 

41  CO 

41 

X 

i-M 

N 

C 

3 

^ O 

X 

o 

XI 

•H 

CM 

41 

IT> 

X 

e <t 

c 

4J 

X 

CO 

X 

vO 

cn 

m CN 

c 

X( 

CO 

c^ 

X 

ON 

^M 

^M 

»-M 

3 

CO 

^M 

0 

CQ  W ^ 

PQ 

w 

flP 

O 

o 

V-9 


A 


Fiske  (1969)  Personality  First  person  vs.  300/airmen  Significant  differences  be- 

inventories  third  person  wording  tween  wording  "What  would 

others  say  about  you"  had 
higher  scale  values  than 
se If -description 


TABLE  V-3  (cont 


1 

i 


i 

1 


Concerning  response  specificity  or  the  avoidance  of  the  "Don't  know" 
category,  three  studies  with  conflicting ^results  were  found.  Dohrenwend 
(1965)  noted  that  the  objective  version  had  higher  response  specificity, 
while  Burtt  and  Gaskill  (1932)  and  Blankenship  (1940®)  concluded  that 
"don't  knows"  increased  with  the  objective  direction. 

The  reliability  of  objectively  or  subjectively  phrased  questions 
has  also  been  investigated.  The  only  study  presenting  evidetice  of  higher 
reliability  for  objective  question  versions  was  presented  by  Blankenship 
(1940a).  Hubbard  (1950)  and  North  and  Schmid  (1960)  presented  evidence 
that  subjectively  phrased  items  were  more  reliable. 

The  only  research  studies  reporting  validity  evidence  have  been  con- 
ducted by  Blankenship  (1940a,  1940c).  in  both  studies  the  objectively 
stated  version  »f  a question  had  higher  predictive  accuracy. 

It  seems  that  follow-up  research  is  warranted  in  this  area.  The 
limited  research  evidence  points  up  more  contradictions  than  similarities 
in  findings.  The  only  area  where  a tentative  conclusion  favoring  objective 
over  subjective  phrasing  can  be  made  is  in  the  area  of  suggestibility. 

Definite  versus  indefinite  article  wording.  Two  studies  were  found 
which  reached  similar  conclusions  regarding  the  use  of  definite  or  indefi- 
nite articles  in  question  stems.  Indefinite  article  ("a"  or  "an")  items 
are  exemplified  by  the  following  type  of  question  --  "Did  you  see  a demon- 
stration of  the  new  night  vision  device?"  A definite  article  ("the") 
item  would  be  werded  --  "Did  you  see  the  demonstration  of  the  new  night 
vision  device?"  Studies  by  Musclo  (1916)  and  Hubbard  (1950)  both  concluded 
that  changing  from  "a"  to  "the"  wordings  reduced  the  level  of  suggestibility. 
The  use  of  indefinite  article  questions,  however,  led  to  increased  relia- 
bility of  answers  when  factual  or  objective  information  was  sought  (Hubbard, 
1950).  No  conclusions  in  this  area  can  be  drawn  because  of  the  limited 
evidence  available. 

Miscellaneous  studies  on  question  wording.  The  previous  three  areas, 
positive  versus  negative,  objective  versus  subjective,  and  definite  versus 
indefinite  article  wordings,  have  been  researched  in  a somewhat  systematic 
fashion.  This  section,  however,  is  designed  to  present  selected  highly 
relevant  studies  about  question  wording  which  have  not  been  replicated  by 
other  social  scientists. 

Several  isolated  studies  have  dealt  with  the  effect  of  building  into 
the  question  some  reference  to  prominent  people.  For  instance,  Cantril 
( I940a)compared  responses  to  the  following  two  questions:  "Do  you  approve 

of  President  Roosevelt's  sending  Sumner  Welles  to  visit  European  capitals?" 
and  "Do  you  approve  Sumner  Welle.s's  visit  to  European  capitals?"  When 
Roosevelt's  name  was  used,  more  people  had  opinions  and  more  people  dis- 
approved (257o  versus  31%).  In  other  studies  of  the  suspected  "big  name 
effect"  the  results  have  varied,  the  big  name  sometimes  making  a difference 
and  sometimes  not  (Belson,  undated  b). 


V-11 


Research  has  also  been  conducted  on  the  consequences  of  employing 
stereotypes,  emotion  charged,  or  culturally  biased  words.  Significant 
changes  in  the  frequencies  of  positive,  negative,  or  don't  know  responses 
may  result  (Roslow,  Wulfeck  & Corby,  1940). 

The  potpourri  of  scattered  studies  on  question  wording  are  illustra- 
ted by  the  following: 

1.  Hubbard  (1950)  reported  that  the  incomplete  disjunction  form,  e.g., 
"Was  the  demonstration  Interesting,  dull  or  just  so-so?"  possessed 
relatively  high  suggestibility  and  relatively  low  reliability. 

2.  North  and  Schmid  (1960)  examined  all  possible  combinations  of 

personal-impersonal  and  qualified-unqualified  forms  of  questions, 
and  concluded  that  personal-qualified  versions  may  be  the  best 
according  to  internal  and  test-retest  reliability  and  independence 
criteria . . 

3.  Steele  (1964)  compared  projective,  direct,  and  indirect  questions. 
Information  from  the  projective  item  (interpretations  of  a picture 
drawing)  explained  more  variance  in  the  dependent  variable  (milk 
consumption)  than  direct  and  indirect  questions. 

4.  Waters  (1966)  presented  evidence  that  a subject's  reaction  to  a 
forced  choice  scale  was  more  favorable  when  some  method  was  in- 
corporated whereby  the  subject  was  given  the  opportunity  to  in- 
dicate the  degree  of  applicability  of  each  item  to  himself. 

"Most  descriptive  of  you"  and  "least  descriptive  of  you"  were  not 
as  effective  as  "the  degree  to  which  this  applies  to  you"  on  a five 
point  scale. 

5.  Thumin  (1962)  experimentally  examined  buffer  items  in  question 
sequencing.  Buffer  items  were  defined  as  neutral  items  intended 
to  establish  rapport  which  were  placed  before  "delicate"  items. 

Study  results  indicated  that  the  buffer  items  increased  respond- 
ent's admissions  of  insomnia. 

6.  In  a study  involving  return  of  a job  satisfaction  questionnaire 
by  477o  of  over  1,000  life  insurance  agents,  the  effectiveness  of 
direct  and  indirect  questioning  techniques  was  examined  (Weitz 

and  Nuckols,  1953).  Results  indicated  that  the  direct  and  indirect 
items  intercorrelated  significantly,  but  that  the  direct  items  in 
general  had  greater  validity  (were  better  predictors  of  job  sur- 
vive 1)  . 

7.  Richardson  (1960)  tested  the  widespread  assumption  that  interview- 
ers should  not  use  leading  questions.  Tape  recorded  interviews  of 
seven  experienced  and  30  untrained  interviewers  were  compared. 
Questions  were  classified  into  leading  and  nonleading  questions. 

A leading  question  was  operationally  defined  as  one  which  includes, 
either  explicitly  or  implicitly,  the  answer  which  the  interviewer 


V-12 


expecLed  to  receive.  Study  results  showed  that;  contrary  to 
expectation,  experienced  interviewers  used  lending  questions  in 
33%  of  all  their  questions;  and  leading  question’?  t/llcited  no 
more  responses  containing  distorted  InforiimLion  thiin  did  non- 
leading  questions. 

Conclusions . The  literature  discussed  in  the  area  of  question  word- 
ing illustrates  certain  gaps  and  thlnnesscc: , Many  triportant  isfn;’:';  have 
been  raised  by  these  studies,  but  tSere  tias  been  little  systematic  p'rsult 
of  the  issues  to  a conclusion.  Seldom  have  repl 'cation  situdlcs  been  un- 
covered or  attempts  made  to  carry  'hem  out  in  dili'-ront  settings.  Mauy 
of  the  studies  have  been  carried  out  vritli  snull  samples  or  only  with 
college  student  subjects.  Finally,  ?io  research  studio.?  were  uncovered 
which  examined  the  wording  of  response  altcrnativec . 


Clarity  of  Items 

Almost  every  check  Mfit  of  prescriptlves  for  writing  quo3t/on  Iteir-: 
includes  instructfons  such  as;  "make  sure  your  quesf'ioiiq  are  clear  to 
the  respondent;"  ".^void  ambiguous,  vague,  and  impreciso  quc.v tionr. a?id 
"questions  must  be  concrete  and  specific."  Unfortunately,  there  has  been 
?.ittlc  systems ! tr.  research  on  measures  of  clarity  or  fr.^  effects  of  un- 
clear questions  on  subjects'  responses.  In  fact,  most  of  the  available 
literature  '3  based  upon  authors'  experiences,  intuition,  and  "common" 
sense  (t-.g.,  Jenkins,  1941;  Roslow  & Blankenship,  1939).  This  issue  has 
rece  ived  some  investigation  in  the  area  of  interviewer  ambig  lity  in  phras- 
ing questions  (Hanson  & Marks,  1958),  a topic  not  covered  hire. 

This  section  will  present  the  limited  evidence  available  on  how  to 
improve  item  clarity  and  the  effects  of  item  clarity  on  subjects'  re- 
sponses. 

Studies  on  improving  item  clarity.  Several  interesting  studies  have 
been  reported  which  suggested  diverse  tactics  of  improving  item  clarity. 
Gray  (1955)  suggested  that  in  framing  questions  which  depend  on  res*-'ond- 
ents'  memory  or  recall  capabilities , the  time  period  a question  covers 
must  be  carefully  defined  and  redefined.  The  when  should  be  specifically 
provided.  Lltwak  (1956)  suggested  that  ad  hoc  rules  on  question  wording, 
such  as  cautions  against  loaded,  vague,  double-barreled  questions,  can  be 
investigated  by  latent  structure  analysis.  Evidence  was  presented  that 
bias  in  questions  may  lie  in  too  many  (ambiguity),  too  few  (clarity),  or 
inappropriate  (clarity)  dimensions.  Thus,  alternative  question  wording 
and  additional  descriptions  may  aid  a subject's  interpretation.  Toops 
(1937)  reported  that  subjects  preferred  a format  where  key  v:ords  or 
portions  of  the  question  stem  were  capltall?;cd.  The  inference  was  that 
an  idea  of  what  is  required  to  respond  to  the  question  and  overall  clarity 
can  be  obtained  by  a glance  at  the  capitalized  material.  No  results  were 
reported  concerning  whether  underlining  key  words  miaht  accomplish  the 
same  goal. 


V-13 


BEST 

AVAILABLE  COPY 


Several  studies  were  found  which  are  attempts  to  isolate  the  amount 
of  clarity  in  questions.  Speak  (1967)  conducted  a study  whereby  subjects 
who  had  responded  to  questions  in  a personal  interview  were  reinterviewed 
the  following  day  by  another  "in-depth"  session  to  ascertain  what  the 
respondent  had  "really"  meant  and  how  he  interpreted  the  questions.  It 
was  found  that  not  one  question  was  perceived  by  every  subject  as  intended, 
nor  did  one  subject  perceive  all  the  questions  as  intended.  It  appears 
then  that  follow-up  interviews  might  be  purposeful  in  screening  paper  and 
pencil  question  items.  For  example,  Nuckols  (1953)  submitted  poll  questions 
to  respondents  and  then,  after  completion,  asked  them  to  interpret  the 
meaning  of  the  questions.  At  least  177o  of  these  interpretations  were  judged 
to  be  wholly  or  partially  wrong. 

Another  clarity  screening  method  has  been  oftered  by  Norman  (1963b). 

He  conducted  a study  of  test  item  content  in  personality  measurement.  The 
results  indicated  that  there  existed  marked  differences  in  the  validities 
obtainable  from  different  classes  of  test  stimuli,  those  with  the  highest 
degree  of  judged  content  relevance  producing  the  most  satisfactory  results. 
To  the  degree  that  relevance  enhances  the  clarity  of  questions,  this  would 
also  seem  to  be  an  appropriate  pretesting  procedure. 

A technique  called  the  "random  probe"  was  used  to  check  what  closed 
questions  actually  meant  to  respondents  in  a survey  in  Pakistan  (Schuman, 
1966)  . Interviewers  were  instructed  to  select  randomly  10  items  for 
further  probing.  Respondents'  understanding  was  then  ranked  on  a five 
point  scale.  Results  indicated  that  with  this  particular  instrument  a 
significant  minority  of  the  respondents  had  real  difficulty  with  the  ques- 
tions . 

Miklich  (1966)  studied  response  sets  in  relation  to  ambiguously 
worded  statements.  Forty-two  subjects  were  given  statements  with  four 
types  of  treatment;  ambiguous,  unambiguous,  important,  and  unimportant. 

They  were  asked  whether  they  agreed  or  disagreed.  The  analysis  indicated 
that  ambiguous  items  did  result  in  more  agreement-disagreement  response 
set.  That  is,  if  the  ambiguous  item  was  important  (not  defined  in  the 
study  writeup)  the  tendency  was  to  agree  with  it,  while  if  unimportant, 
the  tendency  was  to  disagree. 

A large  scale  study  (Bclson,  undated  c)  of  respondent  understanding 
of  over  2,000  items  used  in  market  and  social  survey  questions  included 
a content  analysis  of  reinterview  data  with  265  subjects  regarding  their 
understanding  of  the  original  questions.  Findings  related  to  item  clarity 
were;  if  a broad  term  or  concept  was  used  in  a question,  there  was  a 
strong  tendency  for  respondents  to  interpret  it  less  broadly;  and  respond- 
ents who  failed  to  hear  some  part  of  a question  tended  to  reconstruct  the 
question  from  what  they  had  heard. 

Effect  of  item  clarity  on  subjects'  responses.  Few  studies  have  been 
conducted  in  this  area,  perhaps  because  question  clarity  is  itself  such  a 
vague,  general  concept.  One  important  study  was  offered  by  Armstrong  and 


Overton  (1971) . Two  versions  of  a questionnaire  about  intentions  to  use 
a new  transportation  service  were  tested.  One  version  using  a brief 
description  and  one  using  a comprehensive  description  were  successively 
administered.  No  significant  differences  were  found  on  estimates  of 
level  of  demand  at  various  prices,  or  on  the  identity  of  likely  user 
groups.  Thus,  in  some  cases,  additional  verbal  material  in  questions  or 
topic  descriptions  may  not  alter  subjects'  responses. 


Difficulty  of  Items 

One  of  the  first  "laws"  of  questionnaire  development  advanced  by 
almost  every  general  source  on  how  to  write  sound  questionnaires  is  the 
statement  "keep  it  simple."  Logic  dictates  that  words  used  in  surveys 
should  not  have  multiple  meanings,  nor  should  they  be  beyond  the  level  of 
vocabulary  of  the  typical  respondent.  Unfortunately,  this  advice  is  often 
poorly  operationalized. 

This  section  discusses  measures  of  item  difficulty,  and  miscellaneous 
studies  of  survey  instruments.  The  abstracted  literature  on  item  diffi- 
culty is  summarized  in  Table  V-4. 

Measures  of  item  difficulty.  A series  of  studies  have  taken  standard- 
ized tests  or  published  public  opinion  poll  questions  and  subjected  them 
to  a form  of  content  analysis  against  reading  or  vocabulary  difficulty  in- 
dices. Payne (1950a)  found  that  "tightly  worded"  questions  on  an  opinion 
poll  had  Flesch  scores  at  7th  or  8th  grade  level  whereas  "loose"  questions 
with  large  variance  in  reverse  worded  items  scored  at  the  high  school 
level  or  above. 

Similarly,  Nuckols  (1953)  reported  that  nine  published  poll  questions 
had  remarkable  problems  in  wording  difficulty.  In  an  independent  retest, 
17%  of  his  subjects  had  interpretations  of  individual  questions  which  were 
judged  partially  or  wholly  wrong.  Flesch  scores  ranged  from  5.8  to  17.2 
in  reading  grade.  Another  study  (Terris,  1949),  again  a reexamination  of 
poll  questions,  compared  Flesch  and  Dale-Chall  readability  scores  to  Census 
Bureau  Reports  on  formal  school  levels  of  the  U.S.  population.  Study 
results  indicated  that; 

1.  91.6%  of  all  the  questions  were  above  the  comprehension  level 

of  12.4%,  of  the  population. 

2.  73.4%  of  all  the  questions  were  above  the  comprehension  level  of 

23.2%  of  the  population. 

3.  9.8%,  of  all  the  questions  were  above  the  comprehension  level  of 

72.6%,  of  the  population. 

Difficulty  of  items  has  also  been  assessed  witii  The  Teacher ' s Word 
Book  of  30 ,000  Words  (Thorndike  and  Lorge , 1944).  Users  of  this  source 
state  that  it  is  best  to  err  on  the  side  of  simplicity  if  doubt  exists. 


V-15 


Summary  of  the  Literature  on  Item  Difficulty 


0/ 


Qi 

1 

1 

M 

X3 

1 

CO 

•H 

L4 

3 

CO 

d) 

x: 

>» 

C 

44 

d) 

CO 

•H 

1 

1 

CO 

44 

x: 

c 

x: 

CO 

O 

c 

> 

CO 

CO 

^4 

0) 

o 

Li 

Li 

CO 

0> 

(0 

a 

o 

o 

u 

3 

U 

Li 

x: 

44 

3 

3 

CM 

•rt 

Li 

(>0 

> 

•H 

d) 

CO 

CJ 

u 

B 

•H 

0 

a 

O 

CJ 

44 

> 

44 

• 

Li 

j= 

O 

c 

•H 

d) 

u 

d> 

Li 

CO 

00 

• 

44 

3 

C 

o 

O 

<44 

00 

C 

Cei 

CO 

c 

>» 

•H 

c 

(U 

d> 

>1 

3 

• 

d> 

«H 

Li 

3 

4J 

• 

CO 

•u 

» 

0 

4J 

r-4 

d) 

Li 

44 

^4 

1-4 

m 

'3 

TO 

x; 

3 

•r4 

x: 

3 

3 

«0 

u 

d) 

£ 

• 

4J 

CJ 

^4 

CO 

dO 

o 

c 

f-4 

U4 

CQ 

O 

44 

U 

O 

44 

> 

Li 

d) 

d) 

u 

f-4 

U 

3 

> 

o 

E 

i4 

o 

E 

Li 

44 

•H 

♦H 

r- 

O 

TJ 

3 

«0 

d) 

o 

U 

E 

J= 

O 

00 

CO 

3 

44 

1-4 

• 

E 

o 

L4 

o 

u 

*r4 

•»n 

•H 

c 

o 

CO 

» 

3 

• 

Li 

C 

CO 

3 

3 

a 

44 

44 

c 

o 

o 

d) 

}4 

44 

»H 

J= 

u 

CO 

00 

<44 

00 

o 

Li 

3 

•r4 

E 

3 

x: 

Li 

•r4 

o 

x: 

d) 

3 

V4 

44 

CO 

44 

c 

c 

»i4 

3 

^4 

Li 

•f4 

dO 

O 

c 

4J 

•U 

CO 

0 

•r4 

d) 

»» 

d) 

O 

3 

o 

TO 

•H 

44 

3 

3 

3 

3 

•H 

'O 

•H 

o 

CO 

Q 

•r4 

o 

CO 

44 

d) 

oo 

Li 

d) 

-3 

CO 

CO 

3 

44 

3 

44 

^4 

<1) 

a 

u 

H 

'O 

>4 

c 

4J 

T? 

3 

00 

3 

c; 

C 

E 

U 

TO 

Li 

3 

3 

>> 

c 

c 

d) 

CO 

d) 

c 

p^ 

3 

c 

3 

3 

3 

3 

3 

C 

O 

> 

O 

4J 

d> 

u 

>, 

(0 

a 

CO 

M 

d) 

44 

3 

•»n 

CO 

Li 

3* 

44 

Li 

3 

U 

Li 

3 

X 

CJ 

r— 4 

3 

d) 

•H 

CO 

Li 

C 

•p4 

3 

3 

O 

^4 

<0 

a 

CO 

3 

O 

CO 

>> 

44 

u 

3 

CO 

^4 

c 

r 

•i4 

pC 

<44 

-o  H-i 

O O -H 
2 o 'O 


<0  W *H 
•H  0) 

4J  M CSJ 

M o • 

« o* 

CL  CO  1-4 


c 

CO  0) 
pC  CO  *j 

^ ^ c 

vO  bO  q; 

O .C  3 
cn  u 
CM  <0 


o 

44 

3 

c 

c 

3 

3 

3 

C 

1 

3 

o 

c 

3 

f*4 

c 

3 

o 

3 

CL 

o 

C 

E 

c 

00 

Li 

3 

p 

X) 

O 

3 

»v 

1 

Li 

3 

1 

>s 

3 

44 

3 

O 

0 

44 

O 

44 

> 

c 

Li 

Li 

rH 

•H 

x: 

3 

T3 

O 

3 

3 

3 

1 

O 

44 

3 

Li 

T3 

3 

44 

u 

^4 

44 

4= 

x: 

E 

44 

♦r4 

« 

3 

*/4 

f-4 

x: 

U 

3 

3 

3 

c 

3 44 

C 

x: 

44 

o 

44 

Li 

^ U 

3 

0 

O 

44 

3 

T3 

44 

M 

CL  *H 

3 

44 

•• 

a 

O 

> 

3 

O 

' — , 

p- 

•H  O 

3 

O 

T5 

3 

T3 

E 

x: 

44 

Li 

44  x: 

c 

3 

3 

C 

>» 

3 

3 

3 

3 

C 

3 

1-4  O 

3 

3 

3 

Li 

3 

O 

3 

44 

•H 

3 

3 

3 

D 

3 

3 

3 

O 

•i4 

n4 

3 

a 

E 

3 

E 3 

3 

Li 

p' 

Lt 

3 

X5 

c 

>. 

D 

c 

1-4 

U 

3 

>» 

3 

C 

3 

3 

o 

c 

H 

Li 

3 

P.  CL 

3 

3 

44 

44 

3 

O 

3 

3 

CL 

•H 

44 

OO  ‘H 

Li 

3 

1-4 

•H 

c 

U 

00 

O 

44 

o 

3 

C 

C LJ 

3 

3 

3 

f-4 

3 

3 

O 

3 

3 

3 

c 

c 

3 

o ^ 

CL 

E 

O 

3 

44 

E 

C 

C 

3 

Jsi 

M 

CL 

Li  a 

3 

•H 

•H 

3 

C 

o 

3 

•f4 

3 

c 

o 

? E 

3 

n 

44 

3 

1-4 

•f4 

X 

00 

s 

cr 

p 

c 

o 

.-4  X 

1 

•i4 

r-i  CO 

3 

O 

44  44 

44 

3 

•H 

CL 

•i4  3 

•p4 

C 

CL 

E 

Li  3 

^4 

o 

O 

»-l  3 

3 ^ 

3 

•i4 

3 3 

C 

iJ 

CJ 

C «-4 

P O 

O 

3 3 

•H 

o o 

3 *H 

3 44 

U 44 

t-4 

•H  Li 

e LJ 

Li  3 

3 3 

X3 

44  44 

•|4  3 

3 3 

*3  3 

3 

3 3 

H E 

fl4  44 

W 44 

CL 

p a 

/-s 

CO 

r4 

m 

CO 

m 

vO 

CM 

O 

<T» 

o 

x> 

r4 

CO 

r4 

(T\ 

o^ 

s-/ 

^4 

r-4 

3 

Li 

>» 

1-4 

3 

3 

3 

o 

3 

P 

r4 

Li 

Jlj 

C 

3 

c 

3 

u 

>> 

3 

3 

3 

3 

▻4 

P 

s 

P 

CL 

•V 


Summary  of  the  Literature  on  Item  Difficulty 


u u 

00  D Q) 
CO  00 

4J  0)  Q 
O'  E kJ 


TD 

0) 

•H  Q> 

U 4J 

•1^  oj 

•-t  s:  u 
o>  3 <u 
T3 

(0  (1)  O 

E o e 


(0  CO  ^ 
CO  (1)  CQ 

(jJ  r-t  ^ 


c — • <t 

O Qi  • 

> cn 

4-»  0» 

cn  i-H 
<y  • 
3 C C 

cr  o o 


cs 

^ • *— 4 

> m ■-* 
O CM  CO 
Xi 

CO  y-l 

o o 

C 5>2 

O o 00 

•H  > . 


o 

CO 

c 

>-< 

(D 

c 

•H 

•H 

0) 

CO 

y 

CO 

u 

y 

CJN 

X) 

CM 

> 

<D 

o 

4J 

4-t 

(D 

Li 

3 

4J 

c 

X 

3 

CO 

CO 

r— 1 

CO 

r>. 

4.) 

T3 

X 

CO 

•H 

O 

o 

cr 

CO 

•H 

o 

u 

CD 

y 

■ •« 

Cfl  4-1 

CO 

H 

•r4 

CO 

o 

C 

CO 

X 

3 

3 

cr 

c 

CO 

4-1 

V o 

X 

(D 

i~H 

no 

(U 

n3 

CO 

o 

CD 

Li 

y 

CL 

cr 

o 

o 

c 

O 

1^ 

u 

U 

CO 

•iH 

CO 

•H 

n3 

CD 

1-^ 

Li 

O 

•rM 

•H 

o 

3 

CO 

-O 

no 

(D 

3 

0) 

u 

4J 

3 

a 

CO 

a 

a 

c* 

CO 

■U 

•H 

CO  O 

0) 

0) 

Li 

Li 

O' 

Li 

CO 

CO 

4J 

E 

x: 

c 

CO 

Li 

y 

A)  CM 

•o 

no 

O 

1 

a 

1 

o 

•H 

Li 

4-1 

O 

4-1 

Li 

y 

CO 

> 

od 

c 

CO 

(D 

5 

O 

CO 

O 

B 

<D 

JJ 

O 

O 

u 

O 

X 

3 

y 

y 

5^  CD 
vO  > <t 

• o • 

r-4  ^ CM 
^ (0 


t-C  <D  CU  3 

rH  V4  o cr 

CO  CL  a.  c 
B o o 

UH  O £ t4 

o u o <u  C/} 


CO 

Li 

CO  CO 

f—i  ^ 

3 -H 

>> 

e 

CL  Li 

4J 

y 

O O 

•H 

u 

a LJ 

CM 

M 

c 

•H 

o 

Li  y 

X5 

U 

CO  > 

y 

CO 

C 

o c 

n3 

y 

y 

£ 

y 

CL 

E 

y 

>» 

3 

4-(  M 

M CO 

H 

M 

0 CO 

E 

LJ 

y 

n3  y 

CO 

C Li 

Q Li 

C 

y y 

•H  *H 

M 

> 

Li 

y c 

W 

Va 

in 

;> 

no 

CO 

c 

c 

>> 

y 

•H 

y 

LI 

M 

•H 

< 

LI 

y 

^4 

(0 

CO 

y 

y 

U 

y CO 

3 

c 

M 

•M 

M y 

Li 

0 

3 

a 

OJ  -H 

•H 

CO 

CO 

o 

LJ  Li 

4J 

L> 

y 

H 

c o 

LI 

y 

M Li 

< 

a 

E 

'O  »-i 

3 0> 

•U  ^ 

CO  0) 

o 

pc! 


There  are  many  examples  of  misunderstandings  of  what  seem  to  be  everyday 
words.  One  study  (Roeber , 1948)  found  that  10%  to  20%  of  the  vocabulary 
used  in  seven  of  the  most  popular  interest  inventories  was  above  the  9th 
grade  reading  level  as  measured  by  the  Thorndike  and  Lorge  word  list. 

Unfortunately,  no  studies  were  found  concerning  the  comparative  re- 
sults of  reading  level  measures  using  scores  on  the  Flesch,  Dale-Chall, 
Thorndike  and  Lorge,  or  other  readability  scales.  Also,  no  information 
was  uncovered  regarding  the  "fog  level"  reading  difficulty  scoring  system 
used  by  the  U.  S.  Air  Force,  or  Fry's  Readability  Graph  (Fry,  1968). 
However,  a detailed  literature  search  in  these  areas  was  outside  the 
scope  of  this  review. 

Hanley  (1965)  suggested  two  other  measures  of  item  difficulty  with 
reference  to  personality  testing;  response  latency  and  subjective  confi- 
dence in  accuracy  of  answer.  Either  of  these  measures  might  also  be  em- 
ployed in  pilot  or  pretest  studies  of  survey  instruments. 

Miscellaneous  studies  of  survey  instruments.  More  attention  has 
evidently  been  devoted  to  measures  of  item  difficulty  than  to  the  effects 
of  item  difficulty  on  questionnaire  responses.  Hanley  (1965)  and 
Strieker  (1963)  offer  the  exceptions.  Both  have  examined  the  impact  of 
item  difficulty  on  acquiescence  response  bias,  but  with  conflicting  conclu- 
sions. Strieker  determined  that  acquiescence  was  more  prevalent  with 
moderate  or  hard-to-read  attitude  items,  but  found  the  opposite  relation- 
ship for  personality  items.  Using  the  response  latency  and  subjective 
confidence  difficulty  measures,  Hanley  (1965)  concluded  that  acquiescence 
occurred  with  difficult,  rather  than  easy,  personality  inventory  material. 
Additional  attention  obviously  needs  to  be  focused  on  item  difficulty 
in  terms  of  response  tendencies  such  as  acquiescence,  "don't  know"  and 
"no"  responses. 

One  study  (Faerber , 1951)  has  addressed  the  important  matter  of  com- 
parative difficulty  of  different  response  alternative  formats.  In  a timed 
arithmetic  test,  open  answer,  right-wrong,  multiple  choice,  and  multiple 
choice  with  separate  answer  sheet  formats  were  experimentally  manipulated. 
Results  showed  increasing  difficulty  in  the  order  listed  here. 

Finally,  Myers  (1962)  compared  homogeneous  to  heterogeneous  item 
difficulty  educational  tests.  No  difference  in  validity  coefficients 
were  found,  but  tests  homogeneous  in  difficulty  were  shown  to  be  more  re- 
liable . 


Length  of  Question  Stem 


Only  three  studies  were  found  in  the  literature  search  effort  which 
dealt  with  length  of  question  stems.  It  should  be  noted,  however,  that 
the  topic  of  instrument  length  is  discussed  in  Chapter  VIII. 


V-1  8 


The  first  study  located  (Brinkme  ier , 1930)  was  only  marginally 
relevant  to  questionnaire  construction  because  it  concerned  true-false 
tests  for  high  school  student  examinations.  A statistical  analysis  of 
6,671  question  stems  submitted  by  high  school  teachets  in  a national 
contest  revealed  that  stems  under  twenty  words  in  length  were  as  often 
true  as  false.  As  length  increased  beyond  twenty  words,  however,  the 
probability  that  the  answer  was  true  increased. 

Marquis,  Cannell,  & Laurent,  (1972)  examined  the  impact  of  question 
length  and  respondent  education  on  self-reports  of  health  information. 

The  data  were  later  compared  to  physicians'  reports  of  the  same  informa- 
tion for  each  respondent.  Results  indicated  that  longer  (interview) 
questions  increased  the  accuracy  of  reports  from  those  who  had  finished 
high  school,  and  had  the  opposite  effect  on  those  who  had  not.  Another 
study  (Laurent,  1972),  perhaps  reporting  on  the  same  data  base  as  the  pre- 
vious citation,  offered  evidence  drawn  from  four  experiments  conducted 
with  samples  ranging  from  24  to  200  interviews  for  the  U.  S.  Public  Health 
Service.  Questions  were  altered  in  length  by  adding  redundant,  inconse- 
quential information  in  various  treatments.  It  was  found  that  the  longer 
questions  elicited  more  information  than  the  shorter  questions.  Also, 
after  checking  with  physicians'  reports,  it  was  found  the  longer  questions 
received  more  accurate  answers. 

This  is  apparently  an  underresearched  area.  The  few  isolated  studies 
just  reviewed  concern  only  objective  tests  and  interview  schedule  develop- 
ment. The  conclusion  that  longer  question  stems  (controlling  for  age) 
produce  a greater  amount  and  more  accurate  information  cannot  be  general- 
ized based  upon  the  limited  evidence.  More  research  is  warranted  in  this 
area . 


Order  of  Question  Stems 

Several  different  sources  of  error  must  be  considered  regarding  the 
general  issue  of  question  stem  effects  in  questionnaire  methodology. 

Order  bias  has  several  meanings  in  survey  research  concerning  question 
stems.  For  example,  if  the  question  were  asked,  "Which  kind  of  weapon  do 
you  prefer,  the  M14  or  the  M16,"  one  might  conjecture  that  a reversing  of 
the  order  of  alternatives  within  the  question  stem  might  be  a source  of 
respon.se  error.  Literature  in  this  area  is  discussed  in  the  section  of 
this  chapter  on  the  Wording  of  items.  Order  bias  in  this  section  refers 
to  the  order  of  questions  within  a series  of  items  designed  to  explore 
the  same  subject  matter,  or  related  subject  matter  areas.  A related  issue 
concerns  the  position  effect  problem  --  the  order  of  different  groups  of 
questions,  when  the  groups  deal  with  essentially  unrelated  subject  matter 
areas . 

Table  V-5  presents  a summarization  of  literature  dealing  with  the 
order  bias  and  position  effect  problems. 


V-19 


Summary  of  Studies  Relating  to  the  Order  of  Question  Stems 


0)  c ^ V) 
CL  QJ  to  *H  O 


0) 

0) 

o 

o 

0) 

4h 

CO 

CO 

o 

u 

U 

o 

•r4 

C 

w 

0» 

4-( 

CO 

♦H 

z: 

c 

c 

c 

4-i 

o 

U 

•H 

TJ 

0) 

<u 

o 

4-1 

u 

d) 

£ 

u 

4J 

•r^ 

4-> 

o 

1 

u 

0> 

0) 

<1) 

01 

c 

U 

U 

3 

CO 

CO 

4-1 

4-» 

4-1 

d) 

S-X 

0) 

(0 

d) 

e 

C 

B 

• 

4-i 

4h 

4-( 

u 

4-1 

u 

4-» 

'O 

o 

d> 

u 

•r^ 

u 

•H 

^■1 

•H 

0) 

CO 

4-t 

1-^ 

u 

CO 

a 

u 

u 

<L) 

T3 

<U 

'O 

0) 

4-1 

E 

0) 

3 

D 

o 

ca 

CO 

♦H 

d> 

TD 

'O 

•o 

4-) 

U 

u 

CO 

0) 

4-t 

o 

O 

u 

o 

}-l 

•rH 

o 

O 

•fH 

0) 

c 

4-» 

u 

c 

4-1 

o 

o 

z; 

o 

P 

4h 

c 

4-1 

o 

o 

o 

d) 

d) 

a 

CO 

c 

c 

> 

0) 

3 

H 

c 

0) 

c 

o 

o 

\ 

0> 

c 

c 

CO 

E 

o 

o 

C 

4J 

ij 

1— t 

c 

? 

c 

c 

O 

u 

a 

o 

3 

3 

<u 

CO 

B 

c 

c 

•r“ 

a 

d) 

3 

c 

c 

P 

0) 

'*-v. 

c 

3 

3 

3 

'O 

d) 

<7n 

3 

O 

O 

C/D 

u 

00 

c 

c 

rvj 

o 

r>- 

o 

O 

4-i 

o 

c 

c 

z; 

CO 

z> 

p 

CO 

> 

3 

•i-t 

> 

1 

CO 

0) 

4J 

o 

4-1 

4J 

CO 

»' 

1 

0) 

u 

•H 

E 

u 

;-4 

CO 

c 

B 

■XJ 

df 

CO 

o 

£ 

Q> 

4-t 

c 

CO 

d) 

bO 

di 

0) 

d) 

4/ 

u 

•H 

a 

B 

•H 

4-1 

4-4 

•r4 

CO 

3 

e 

• 

4J 

4J 

C 

£ 

•rt 

•H 

U 

U 

•r4 

O 

XJ 

o- 

4-t 

CO 

•H 

<0 

<v 

<0 

U 

4J 

o 

X) 

4J 

d) 

3 

4) 

CO 

> 

4J 

•H 

4-1 

•r4 

CO 

U 

•r4 

-C 

CO 

•H 

X3 

<u 

TJ 

0 

U 

4J 

CO 

d) 

E 

;4 

d> 

3 

4J 

u 

di 

M 

'U 

d> 

cd 

• 

4-t 

u 

o 

0) 

'V 

CO 

4-4 

H 

0) 

N 

cu 

CO 

CJ 

4J 

CO 

'rt 

4J 

T3 

u 

CO 

4-1 

d) 

> 

CO 

N 

•i-t 

CO 

X 

d) 

a 

4J 

CO 

o 

•H 

o 

C 

0 

U 

f—t 

c 

o 

DO 

OJ 

4-t 

s: 

o 

o 

CO 

4> 

0) 

O 

o 

CO 

•r4 

u 

XJ 

XJ 

CO 

f-w 

p 

P 

T3 

o 

bO 

•H 

•r4 

CO 

E 

CO 

a) 

>> 

di 

4J 

d) 

(1) 

CJ 

C 

TS 

0) 

4J 

>-t 

u 

•H 

c 

CO 

u 

<D 

•1^ 

4J 

•H 

CO 

•H 

u 

u 

3 

o 

c 

4J 

<J 

o 

u 

u 

o 

c 

4J 

0> 

u 

r-4 

5-4 

<U 

U 

u 

CO 

CO 

d) 

3 

CO 

o 

•H 

U 

CO 

3 

CO 

3 

d) 

u 

CO 

4m 

Ck: 

u 

W 

s_^ 

E 

CO 

4-) 

T— 4 

4-4 

> 

CJ 

> 

cr 

CO 

o 

E 

O 

o 

CO 

CO 

CO  'P 

CO 

CO 

CO  -P 

C 

•H 

•r4 

•H  *P 

o 

PQ 

X>  CO 

•i-l  4-> 

o ■*-> 

4J  U 

u 

P 

P a u 

•i-t  <1/ 

di 

di 

4J  4) 

CO  4-t 

X3 

XO 

T3  -O  4-1 

o 4-t 

P 

P 

P C 4-1 

CU  d) 

O 

O 

O CO  4J 

3 

CO 

r-4 

4) 

4J 

4J 

4J 

E 

CO 

p 

»H 

-C 

3 

CO 

p 

1— t 

4J 

c 

•H 

3 P 

C 

4J 

r-4 

bO 

CO 

CJ  CO 

3 

CO 

CO 

C 

c 

P 

*H  0>  CO 

O 

P 

•H 

3 

u 

4-1  P £ 

C 

Xi 

c 

P 

o 

P 4J 

O 

4) 

CO 

3 

c 

•H  4-t  P 

c 

•-3 

s 

o 

O O *H 

3) 

/-s 

X /-N 

\0 

CO 

C <}“ 

vO 

m 

CO  vO 

ON 

vO 

ON 

ON 

1-4 

C3> 

f-4 

c 

bO 

VP 

r-H 

NP 

p 

P 

r- 

P 

vp 

3 

<y 

o 

di 

di 

P 

^ C 

XJ 

u 

JZ 

C <t 

C 

x: 

X3  O 

E 

o 

p 

c x> 

di 

(U 

p CO 

3 

CO 

4> 

d)  o^ 

X 

a 

as 

^4 

PQ 

di 

3 

be: 

P »-* 

PQ  VP 

o 

u 

(Table  continued  on  next  page) 


(J) 

E 

0) 


0) 

D 


'U 

Vi 

c 

c 

0) 

d) 

1 

(0 

o 

u 

3 

4-t 

•H 

•H 

0 

(ft 

O 

0) 

4J 

<0 

0> 

C3 

(ft 

c 

00 

■u 

o. 

>» 

c 

0) 

c 

u 

(0 

(ft 

»r^ 

u 

o 

3 

o 

(ft 

VI 

Vi 

j: 

c 

CU 

cr 

•rH 

(V 

c 

•H 

(ft 

(ft 

0> 

(A 

4J 

Vi 

(0 

(VI 

u 

c 

3 

<u 

M-i 

(ft 

(U 

> 

1— 1 

o 

cr 

v« 

o 

(U 

VI 

TD 

T) 

D 

•H 

(U 

1 

3 

C 

(0 

0) 

(ft 

4J 

u 

c 

c 

cr 

•iH 

(ft 

U 

0) 

<0 

o 

o 

•iH 

<u 

pc 

c 

•H 

0) 

00  TD 

d) 

c 

■u 

,c 

c 

•H 

u 

0) 

E 

•H 

•u 

•H 

(ft 

I 

0^ 

0) 

(ft 

<u 

c 

o 

s 

■u 

o 

c 

3 

o 

1 

■U 

•H 

a 

o 

PQ 

3 

o 

(ft 

t 

*o 

4-1 

4-1 

Q> 

0) 

0) 

(ft 

(ft 

(ft 

v< 

VI 

Vi 

u 

•T3 

3 

cx 

(0 

Vi  *H 

♦r4 

C 

O 

Vi 

Q>  (V4 

(VI 

(U 

CV  4-1 

3 

Vi 

(ft 

•H 

0) 

0) 

o *o 

V 

4J 

O 

Vi 

u 

r-l  (D 

o 

v> 

0) 

O 

0) 

4-1 

(VI 

73 

E 

? 

^ (0 

3 

1 

Xi 

Vl 

(U 

O 

1 

(0 

P 

(ft 

4-1 

u 

u 

V 

c 

V 

3 »-i 

3 

o 

x> 

•H 

<u 

(U 

o 

<0  <0 

(U 

CO 

d> 

(ft 

4J 

VI 

O 0) 

p 

0) 

a 

•H  *0 

•H 

• 

oo 

CO 

3 

(4-1  -H 

> 

73 

•H 

3 

u 

Vi 

•H 

(U 

3 

(VI 

<u 

•r4 

•H 

4-1 

3 3 

0) 

3 

Vi 

(ft 

(ft 

00  01 

o 

u 

O 

O 

<y 

(U 

3 

•H  £ 

13 

u 

U 

E 

u 

T3 

M 

(ft  ^ 

0> 

I 


(ft 

0> 

•H 

u 

(ft 

XJ 

D 

(ft 

O 

o 

vO 


C 

o 

c 

c 

D 

C 

S 

o 

c 

Jti 

c 


c 

OJ 

E 


a 

4-» 

D CO 
Q 4-> 

•H 
•— I 


(0 

3 

TD 

(0 

Vi 


bO  0) 
\ TJ 
rH  D 
<f  J-» 


O' 

2:: 

i—i 

D 

m 

E 

00 

3 

u 

• *N 

(|-4 

• • 

3 

i 

X 

3 

/-s 

o 

1 

3 

(ft 

1 

(ft 

01 

•X 

03 

<U 

T) 

V 

o; 

(ft 

0> 

1 

X 

Vl 

1 33 

00 

V 

1 

V) 

O 

3 

00 

01 

Vl 

3 

o 

CO  03 

CO 

o 

3 

(ft 

M 

X 

3 

00 

0) 

3 

33 

33 

o 

V C3 

CL 

T3 

4.4 

00 

O 

01 

a 

3 

X 

3 

X 

> 

3 

03 

3 

X 

03 

*X  03 

1 

Vl 

3 

3 

3 

VJ 

•H 

3 

X 

3 

33 

0) 

U 

3 

U 

3 Vl 

o 

X 

V) 

o 

0) 

•H 

• .. 

3 

3 

•rH 

3 

U 

OJ 

0 

3 

o 

03  CL 

Vi 

3 

• 

X 

3 

£ 

4-1 

E 

w 

0) 

(ft 

> 

3 

X 

X 

X 

E 

3 

3 

E 

E 

35 

a 

03 

3 

X 

03 

o 

01 

4-1 

(ft 

01 

3 

o; 

03 

> 

V3 

3 

3 

03 

03 

»x 

03 

-■ 

X 

> 

03 

3 

o 

(0 

•H 

4J 

u 

o 

00 

3 

d) 

d) 

3 

TD 

o 

0 

3 

D 

X 

o 

X 

3 x: 

|X 

X 

35 

>—✓ 

4J 

d) 

X 

•r4 

3 

3 

(A 

3 

oo 

00 

U 

0> 

X 

X 

3 

3 

•X 

X 

3 

O 4-1 

CO 

3 

X 

3 

Vl 

01 

0) 

3 

X 

•r4 

3 

3 

d) 

X 

*x 

V4 

03 

X 

X 

O 

O 

tn 

o 

H 

• 

3 

c 

HD 

• « 

X 

X 

X 

3 

3 

33 

>.  4J 

U 

33 

•X 

OOM-I 

33 

3 

o 

1 

4J 

'O 

(ft 

0> 

cr  4-1 

3 

X 

3 

3 

3 

00 

E 

01 

rX 

3 

o 

03 

3 

3 O 

•X 

iX 

X 

'D 

35 

> 

V 

0) 

> 

(ft 

01 

W 

> 

X 

(ft 

?5 

•H 

•r4 

3 

33 

X 

O 

M-l 

33 

0) 

•X 

- 

u 

03 

03 

00 

0) 

N 

3 

(ft 

0) 

U 

01 

> 

> 

1 

X 

X 

3 

3 

U 

3 

'O 

33 

u 

3 

3 

3 

a 

c 

'V 

>, 

V 

O 

V 

3 

u 

TD 

33 

3 

3 

•H 

O 

U 

'O 

O 

3 VI 

35 

o 

X 

X 

3 

•H 

V 

rX 

0> 

(X 

0) 

0) 

33 

0> 

3 

3 

01 

01 

CX 

•X 

X 

3 

a 

3 

03  ‘X 

E 

03 

3 

X 

03 

•X 

CO 

c 

4-1 

o 

CO 

TJ 

(ft 

> 

4J 

X 

0 

X 

(ft 

3 

00 

?> 

u 

3 

3 

3 

03 

3 

3 

(X 

03 

X 

3 

3 

> 

X 

(0 

3 

Vl 

01 

•H 

3 

(J 

V 

3 

•H 

•H 

3 

3 

o 

03 

03 

•X 

V 

0) 

0) 

03  *X 

X 

CO 

03 

3 

03 

3 

s 

< 

o 

V 

•H 

D 

(X 

•H 

T3 

33 

X 

M 

(X 

U 

3J 

rX 

X 

U 

1—4 

33  X 

•X 

pc 

(X 

•X 

X 

O 

01 

u 

(X 

03 

O 

03 

(ft 

3 

X 

01 

4-1 

03 

Vl 

X 

•tH 

(J 

o 

•X 

CO 

<0 

T3 

(«H 

0> 

-3 

(0 

(ft 

u 

3 

X 

3 

V 

rX 

H 

3 

Q 

(H 

0) 

(0 

3 

•X 

*x 

E 

00 

03 

•X 

s»/ 

(VI 

3 

•H 

•H 

U 

X 

03 

03 

3 

a 

E 

3 

CO 

dj 

W 

•H 

X) 

3 

3 

3 

X 

•X 

X 

•X 

O 

3. 

E 

1-^ 

3 

E 

0) 

3 

♦X 

X 

03 

3 

•X 

X 

(4>4 

>» 

V 

(0 

V 

U 

D 

'U 

0) 

00 

O 

rX 

3 

X 

u 

o 

H 

01 

X 

0) 

0) 

U 

3 

X 

3 

a 

X 

3 

E 

sz  B 

•X 

03 

'O 

Pi 

-o 

'U 

o; 

0/ 

3 

3 

3 

3 

3 

o 

X 03 

3 

X 

V 

V 

c/3 

V 

d) 

0) 

0) 

V 

•X  X 

O 

>x 

V 

o 

o 

o 

= 

X 

3 

O 

u 

X 

V 

(X 

a 

03 

I 

3 

3 

CO 

3 

01 

01 

•H 

3 

> 

3 

(X 

3 

(ft 

1 

•X 

0) 

o 

Vl 

V 

3 

4-1 

3 

X 

> 

0 

u 

01 

O 

(ft 

cx 

•X 

3 

3 

•H 

B 

•H 

0) 

00 

•X 

X 

00 

3 

3 

3 

4J 

0) 

V 

3 

X 

V 

u 

3 

03 

a 

(ft 

O 

CO 

M 

0) 

•rl 

u 

01 

•X 

(X 

H 

3 

•H 

01 

•H 

4J 

>> 

3 

3 

3 

X 

O 

o 

3 

3 

CO 

3 

3 

0) 

U 

d> 

33 

CO 

X 

a 

P 

cr 

3 

H 

X 

a 

H 

33 

3 

PC 

Cl 

> 

03 

*3 

X 

CJ 

3 

3 

03 

3 <t 

X \C 

O »-l 

X 

X X 

3 vO 

3 vO 

•D 

c/^ 

X 

o o> 

(X  o> 

3 <y* 

03  iX 

X iX 

O fX 

a 

(j) 

p-  sx 

V-2l 


•n. 


Order  bias.  One  of  the  most  typical  caveats  discussed  in  the  general 
literature  of  how  to  construct  questionnaires  is  the  statement,  "vary 
(randomly  assign)  the  order  of  questions  on  an  instrument  to  avoid  one 
question  contaminating  another."  Especially  prominent  are  the  discussions 
v..f  cases  where  the  immediately  preceding  question  or  group  of  questions 
places  the  respondent  in  a different  "mental  set"  or  fram.e  of  reference. 
For  example,  asking  respondents  a general  question  about  their  feelings 
about  automobile  exhaust  pollution  might  influence  responses  to  a question 
like:  "Do  you  prefer  leaded  or  nonleaded  gasoline?"  Although  this  effect 

may  be  prominent  in  specific,  applied  settings,  little  evidence  was  found 
in  the  literature  supporting  a general,  order  bias  phenomenon  in  survey 
research.  Five  studies  (Baehr,  1953;  Blumberg,  DeSoto  & Kuethe,  1966; 
Brenner,  1964;  Ferber,  1966;  and  Lyman,  1949)  were  unable  to  document  orde 
biases  in  investigations  in  divergent  topical  areas.  Five  studies  (Cohen, 
1965;  Gross,  1964;  Hofstee,  1966;  O'Dell,  1962;  and  Survey  Research  Centre 
1972)  found  that  the  presentation  order  of  question  stems  significantly 
affected  response  distributions  to  given  items,  nonresponse  to  items,  and 
preferences  for  specific  stimuli.  Thus,  the  findings  in  this  area  are 
inconclusive  --  no  support  exists  for  the  presence  of  a general  order 
effect  in  questionnaire  responses.  Although  the  literature  that  was 
reviewed  on  this  topic  was  sparse,  it  appears  that  order  effects  are  a 
function  of  the  specific  instrument  and  subjects  employed  in  the  investiga 
tion.  It  is  interesting  to  note  that  order  bias  or  question  sequence  may 
be  a subtle  issue  in  specific  cases.  More  experiments  testing  the  effect 
of  changing  the  sequence  of  questions  have  been  uncovered  that  show  no 
effect  than  show  significant  differences. 

Position  effect.  Practical  advice  on  how  to  avoid  position  bias 
problems  abounds  in  the  questionnaire  development  literature.  Suggestions 
to  phrase  questions  in  a logical  sequence,  build  rapport  first,  ask  for 
the  basic  information  sought  next,  and  personal  questions  last,  are  illus- 
trative of  the  guidelines  offered  the  questionnaire  designer.  From  the 
literature  review,  it  appears  that  the  extent  of  a general  position  bias 
is  unknown.  This  is  an  area  that  is  poorly  documented.  Four  studies 
(Bradburn  & Mason,  1964;  Cohen,  1965;  Lyman,  1949;  and  Metzner  & Mann, 
1953)  were  unable  to  find  any  effect  of  changing  the  sequence  in  which 
major  sections  of  questionnaires  were  presented.  Conflicting  evidence 
was  offered  by  experimental  results  presented  by  Landon  (1971)  and  the 
Survey  Research  Centre  (1970).  Again,  it  must  be  concluded  that  systema- 
tic research  in  this  area  is  lacking.  As  in  the  previous  case,  however, 
position  bias  may  be  operative  in  specific  research  situations,  but  the 
weight  of  the  evidence  supports  a negligible  influence  of  position  bias 
on  survey  findings. 

Cone  fusions . The  results  in  the  areas  of  order  bias  and  position 
effect  cannot  be  regarded  as  definitive.  In  light  of  the  unknown  in  these 
areas,  individual  questions  and  question  sections  can  probably  bo  placed 
into  whatever  appears  to  be  the  best  psychological  or  most  logical  order. 


Order  o£  Response  Alternatives 

One  of  the  principles  of  questionnaire  development  advanced  by  psy- 
chologists is  that  the  responses  to  a particular  proposition  will  be 
influenced  by  the  position  of  the  alternative  in  the  question.  In  the 
literature  of  questionnaire  methodology,  it  is  also  known  as  the  "time 
error"  and  can  occur  in  questionnaire  applications  as  well  as  with  labora- 
tory methods.  Mathews  (1929),  in  one  of  the  earlier  works  to  recognize 
this  response  pattern,  noted  in  reviewing  the  results  of  an  experimantal 
study  that,  although  overall  differences  in  sequencing  existed,  the  first 
of  two  alternatives  in  a question  where  the  order  was  varied  received  more 
endorsements  than  the  second  position.  This  study  also  suggested  that  the 
fourth  (of  five)  response  alternatives  was  chosen  somewhat  more  frequently. 
Mathews'  work  has  received  only  token  empirical  support  with  respect  to 
other  reviewed  literature.  Belson  (1965)  and  Winthrop  (1958)  offered 
evidence  that  reversal  of  verbal  or  numeric  rating  scale  response  alterna- 
tives are  coupled  with  a significant  shift  in  endorsements  toward  the  first 
presented  end  items  or  anchors.  Belson  (1965)  reported  that  a reversal 
from  positive  to  negative  scale  orders  resulted  in  a greater  proportion  of 
choices  of  negative  (or  unfavorable)  end  categories.  Winthrop 's  evidence 
suggests,  similarly,  that  reversal  of  numerical  preference  alternatives  in 
natural  numbers  order  (e.g.,  1,  2,  ...,  5)  results  in  lower  scale 
reliability . 

Two  additional  studies  documenting  an  order  effect  in  response  alter- 
natives were  found.  Becker  (1964)  reported  that  subjects'  choices  of 
their  five  favorite  types  of  radio  and  T.V.  programming  were  influenced 
by  the  ordinal  position  of  the  choices  in  a checklist.  This  study  suggests 
that,  as  an  item  is  listed  close  to  the  end  of  a checklist,  the  probability 
of  its  selection  is  reduced.  Madden  and  Bourdon  (1963)  found  that  revers- 
ing the  order  of  levels  of  job  factors  that  were  presented  to  airmen  for 
evaluation  of  various  jobs  resulted  in  significant  differences  in  job 
ratings . 

The  studies  discussed  above  must  be  regarded  as  the  exceptions  rather 
than  the  rule  in  this  research  area.  Seven  experimental  studies  (Blumberg, 
DeSoto  & Kuethe,  1966;  Campbell  & Mohr,  1950;  Clark,  1956;  Dyer,  Klein,  & 
Yudowitch,  1975;  Feldman,  1969;  Kane,  1971;  and  Symonds , 1936)  reported 
little  or  no  order  effects  with  response  alternatives.  The  first  study, 
for  example,  experimentally  manipulated  the  "good"  end  of  a graphic  rating 
scale  in  left,  right,  top,  or  bottom  positions  with  minimal  resultant 
effect  on  ratings.  The  analyses  conducted  by  Dyer,  Klein,  and  Yudowitch 
(1975)  concerned  a VOLAR  study  administered  at  Fort  Hood,  Texas,  with 
over  500  military  subjects.  Reversal  of  response  alternatives  was  accom- 
plished by  presenting  one-half  of  the  subjects  with  alternatives  listed 
from  most  positive  to  most  negative;  e.g.,  "The  training  I have  received 
at  Fort  Hood  has  been:  very  challenging,  challenging,  borderline, 

unchallenging,  very  unchallinging."  The  remaining  subjects  received 
response  alternatives  listed  from  most  negative  to  most  positive.  This 
treatment,  used  on  both  attitude  and  satisfaction  scales  in  the  VOLAR 
questionnaire,  did  not  produce  significant  differences  on  either  individual 
items  or  categories  of  items. 


V-25 


Several  problems  uncovered  by  the  literature  reviewed  tor  this  section 
preclude  arriving  at  any  valid  generalization  concerning  order  effects  in 
response  alternatives.  First,  most  of  the  available  published  studies 
have  been  conducted  with  relatively  small  samples  of  college  student  sub- 
jects. Second,  the  number  of  studies  conducted  in  this  area  is  limited. 
Third,  no  systematic  research  has  been  published  with  respect  to  the  order 
of  response  alternatives  in  specific  types  of  rating  or  scaling  devices, 
such  as  graphic  or  verbal  ratings,  semantic  differential,  or  Likert  scales. 
Fourth,  important  moderating  variables  such  as  subjects'  characteristics, 
topical  area,  scale  length  (number  of  response  alternatives),  and  instru- 
ment length  have  neither  been  controlled  nor  built  in  as  experimental 
trea  tments . 

The  reviewed  studies  on  the  order  of  response  laternatives  are  sum- 
marized in  Table  V-6  . Because  of  the  inconclusive  nature  of  the  findings 
and  their  contradictions,  care  probably  should  be  taken  to  alternate  the 
order  of  response  alternatives  when  it  appears  appropriate  to  do  so.  In 
this  vein,  it  should  be  noted  that  several  authors  (Ross,  1934;  Hosier  & 
Price,  1945)  have  developed  tables  to  standardize  the  order  of  presenta- 
tion of  words.  Although  these  tables  were  designed  to  provide  systematic 
variation  of  paired  comparison  and  multiple  choice  items,  they  may  be 
applicable  to  verbal  rating  scales  as  well.  Ross  (1934)  states  that  his 
method  will  aid  in  wording  "regular"  repetition  patterns,  providing 
optimum  spacing  between  identical  words,  and  balancing  out  fatigue  effects. 


V-26 


ry  of  Studies  on  Order  of  Response  Alternatives 


4J 

COl 


^ f—4 

JS  *H 
Xi 
CO 
O ^ 

u o 

w 

0)  o. 

tf) 

O €) 
^ x: 
o u 

•o 

^ iJ 
4J  M 
io 


•H 

ai 

c 

C U) 

3 

M M-( 
O 


M-l  « u 

O > 

•H 

M • U U-l 

"O  M •» 
k>  ^ «l  V 
O <V  C •-< 
W 

C *0  0 
O O U CO 
•H  4J 
JJ  ^ 

« ^ O O 


V)  o 
0) 
JZ 


u 


M U 


C W 

o 

M w 
0) 

Wi  « 

a M 

4J 


tfi 

w 

u 

c 

c« 

(A 

V)  o 

S'? 

« 


« u 

•H  B 

M-f  X O 

o a 
« « 
u u 
to  «> 


id  4^ 

o « 


•o 
u)  e 

V V 
M •« 

U V « U 
O >-<  *0  • 
fr  .O  C 4J 


"O 

c 

« 

a 

‘rd 


kl 

« _ 

c 

•H 

B e 


o. 

o 


O 

M 

« M 
U C P 
C fl  «i 
«)  M *0 
hi  hi 

« 01  O 

Ud  to 

•*d  m o 

•rd  id  U 
•V  V 
> 01 
hJ  « a 
B ’W 

« 


M 

i 

> 

« 

id 


7> 

01 

u 

t: 

« 

hi 

•s 

« 

hi 

Ch 


B 

u 

5 

o e 

01 

01 

o 

.c 

(A 

W 

•H 

V/ 

S JJ 

aH 

0) 

0 

4J 

w 

hi 

0 

•o 

hi 

4J 

JS 

44 

fl 

hi 

<0 

w 

^ « 

> 

« 

*0 

B 

•H 

•H 

B U 

s 

•H 

w 

(0  M 

T* 

<0 

hi 

o 

3 

M 

S) 

a ‘rd 

0 

44 

CA 

4J 

•l-l 

w c 

01 

44 

B 

•o 

fd 

60 

00 

U tu 

•H 

c 

O 

»H 

0 

hi 

hi  o 

hi 

c 

0) 

41 

hi 

4» 

6 

B 

•H 

V 

4J 

(0 

0 

0>  0«'M 

3 

ID 

> 

a 

•H 

CA 

44  a 

•W 

T) 

>.-4 

> 

7>  in 

■ '* 

41 

u 

U 

hi 

V > 

(A 

U) 

c 

4J 

9 

M 

« 4' 

V • 

hi 

0) 

Q 

CO 

a 

o 

h4  *14 

0 

< 

0) 

*H 

(A 

•hi 

UO  'M 

f'/ 

r: 

a 

o 

iJ 

CA 

p 

hi 

SB 

Ch44 

Ou 

01 

44 

a 

O 

9 

M 

H 

t) 

V 

— - 

00 

u 

c 

A 

B 

9 

4J 

01 

v> 

B 

(A 

3 

aH 

0 

CA 

u 

i) 

O 

0 

u 

4J 

o; 

•H 

4J 

•O 

4J 

B 

0 

W 

c 

•f* 

M 

(0 

B -O 

c 

iZ 

u 

4J 

n 

M 

01 

4J 

Q B 

o 

B 

B 

IP 

3 

S 

u 

(A 

M C5 

•o 

3 

o 

a 

Q 

bO 

' — f-l 

♦H 

00 

•o 

i 

• 

cn 

O 

CO  to 

(A 

§ 

<si 

3 

a 

o| 

3 

m B 

9 

A 

U 

B 

o. 

XI 

£i 

<n  u 

W 

h4 

1*4 

m 

3 

1 

t 

4J 

a 

«> 

»-4 

# 

u 

0) 

U 

« 

(A 

hi 

CA 

B 

o 

0 

iJ 

s 

M 

0) 

44 

U 

B 

c 

CA 

m 

a 

W M 

hi 

9 

•o 

0 

CA 

A 

O 

• 

4J 

i 

0) 

■D 

4J 

44 

hi 

dJ 

a 

•H 

a c 

CA 

Jj 

u 

•H 

o 

>0 

00  44 

hi 

X 

4J 

«A 

9 

« 

CO 

CO 

0) 

"0 

B 

B 

01 

o 

m 

44  B 

•i4 

iJ 

•o 

o> 

hi 

c 

u 

f-H 

4) 

W 

•o 

V 

h-4 

6 

•H 

a 

p CA 
Oi 

u 

•H 

U 

•'  J3 

> 

4J 

V 

44 

A 

•H 

44 

o 

9 

JZ 

H 

"3 

0) 

in  to 

•H 

B 

in 

" 

a 

0) 

a 

*• 

0 M hi 

D4  44  > 

u 

u 

£ 

Ch  hi 

U 

01 

hi 

"O 

u 

X 

o 

hi 

a a 

O *a4 

•H 

hi 

0 

u 

s p 

•H 

w 

2! 

o 

4J 

hi 

0 

B >H  >H 

4J 

o 

X 

a> 

44 

• > 

(A 

0) 

> 

o 

w 

O h4  l-l 

19  CO 

X 

•o 

•o 

O 

c 

hi  <0 

0 

M 

V 

jbl 

•H 

hi 

A 

B 

no 

fd  O 

c 

^ e 

u 

hi 

0^ 

00  44 

o* 

•h 

hi 

m 

X 

a 

4J 

0 

V 

4J  OOX 

•S 

U k4 

O 

•H 

c 

fr 

.c. 

hi 

•H  a o 

0 

9 9 

9 

u 

9 

(A 

O l*N 

JS 

a 

M 

00  4.1 

a 4J  a 

a a X 

s 

44  4J 

> 

4-1 

4» 

3 ^ 

CM 

hi 

a 

*H 

i 

K 

44  aH 

•H 

o 

> 

4J 

H 

w 

H 

kO  9 

> 

A u u 

fl 

9 CO 

44 

1 a 

0 

• 

a X 

3 

a 

M 

B 

n 

X hi 

0 

hi 

0 

o 

•H 

*rl 

M 

a a 

in 

B 

•H 

44 

•M 

C 9 

« 

hi 

hi  ;> 

a 

Si 

a 

"O 

•H  44 

U 

> 

00  hi  • 

»— I 

1 

hi 

fO 

a 

0$ 

S w 

U 

a 

a 

c o u 

B a 

a 

hi 

9 

hi 

a 

*0  9 

•H 

u 

H 

•H  Ch  hi 

O -H 

u 

U 

•H 

CO  S 

a 

B 

Baa 

> 

a 

a 

a 

c 

44 

B 

o 

a 

*0 

B 

B 

» 

a 

9 

O 

B 

9 d 

H 

hi 

a " •• 

O B 

00 

V4 

a 

So  0 

a 

o 

hi  M U 

•H  O 

B 

TJ 

a 

9 

a 

h4 

9 «a4 

44 

*H 

Ob  3 -ri 

B td 

•r4 

a 

a 

44 

a 

00 

1-4  4J 

a 

tj 

o a M 

•rl  W 

hi 

hi 

S 

9 

A 

o 

f»4  9 

h4 

a 

hi  B 3 

Ob-H 

a 

a 

a 

h4 

o S 

P4 

hi 

Oi'-'  a 

o > 

hi 

oc 

B 

fi 

£P 

Ch 

u ^ 

hi 

01 

^ in 
o 


C ^ 

o in 

.2^ 

XC 


o»«4 


§ M V » 

n 0 i2  ^ 


i *o  o\ 

8 B •-• 

o oj  ^ 


53 

y w 


V'ZI 


(Table  continued  on  next  page) 


Summary  of  Studies  on  Order  of  Response  Alternatives 


r 


CM 

•H 

Oi 

D 

M 

TO 

£ 

CM 

4-i 

o 

TD 

3 

U 

T3 

CO 

TO 

r- 

3 

M 

TO 

CO 

TO 

3 

O 

3 

M 

CM 

c 

CM 

TO 

£ 

'O 

TO 

a 

o 

(0 

o 

O 

O 

TO 

U 

3 

3 

c 

•M 

•H 

c 

U 

(0 

3 

TO 

M 

M 

CO 

CO 

TO 

U 

TO 

TO 

M 

£ 

'O 

TO 

TO 

bO 

C 

M 

X 

O 

M 

O 

•H 

CO 

M 

C 

o 

TO 

TO 

3 

TO 

J= 

3 

'O 

C 

»M 

•H 

CM 

TO 

CM 

M 

u 

0) 

M 

M 

CM 

M 

M 

CM 

M 

3 

M 

O 

CO 

TO 

TO 

•iH 

TO 

TO 

•M 

3 

TO 

3 

cy 

0) 

U 

C 

TJ 

rH 

CM 

*3 

O 

3 

CO 

n:) 

<y 

•H 

o 

CM 

CM 

O 

a> 

TO3 

a. 

0) 

M 

a 

•M 

4J 

x: 

(x: 

0 

3 

J= 

£ 

c 

T3 

•3 

3 

*3 

(J 

M 

CM 

■u 

o 

' TO 

bO 

TO 

3 

3 

<M 

• H 

o 

u 

CJ 

c 

4-> 

M 

U 

3 

TO 

0 

■U 

•u 

•H 

•M 

3 

cr 

•iH 

p> 

>» 

M 

u 

u 

0) 

CM 

CO 

3 

•rH 

CM 

M 

•H 

»M 

j 

i-H 

TO 

<v 

> 

•iH 

M 

(J 

X 

•M 

3 

M 

M 

TO 

(0 

'O 

CM 

*H 

c 

TO 

♦H 

TO 

3 

U 

3 

3 

CO 

U 

&0 

Vh 

CM 

M 

bO 

> 

CM 

bO 

*|H 

3 

TO 

u 

<V 

C 

O 

TO 

CJ 

•H 

TO 

•M 

M 

•iH 

CM 

L> 

3 

a> 

s: 

•M 

0) 

CO 

U 

3 

TO 

3 

TO 

cr 

> 

M 

M 

0) 

M 

bO  ”3 

TO 

M 

TO 

o> 

•M 

TO 

O 

o 

Q 

•fH 

M 

o 

x: 

M 

cti 

(U 

H 

c 

TO 

:s: 

M 

CO 

O 

2: 

■M 

3 

CM 

a 

T3 

3 

x: 

c 

1 

tM 

TO 

bO 

TO 

•M 

0) 

6 

•M 

rH 

s 

£ 

x: 

CO 

U 

•H 

c 

TO 

•r^ 

TO 

/-V 

u 

0) 

B 

o 

<y 

bO 

3 

Vh 

bO 

TO 

u 

U 

CO 

bO 

TO 

O 

TO 

bO 

OJ 

•M 

TO3 

}-t 

Q) 

CO 

i-H 

CO 

U 

»r4 

3 

rM 

3 

3 

•r- 

CM 

o; 

QJ 

iM 

M 

M 

•H 

3 

M 

rH 

M 

O. 

<M 

M 

a 

f-^ 

c 

o 

C 

CO 

3 

3 

o 

3 

3 

0 

CO 

o 

(L) 

CJ 

TO 

3 

TO 

CJ 

TO 

M 

W 

•M 

o 

TJ 

TO3 

*3 

'O 

X 

o 

1—^ 

u 

■\. 

3 

o 

3 

3 

o 

3 

TO 

o 

c 

TO 

m 

M 

m 

M 

O 

CX5 

M 

ro 

M 

3 

m 

0) 

U 

r- 

CO 

rH 

(0 

sD 

iM 

3 

3 

3 

1 

>, 

O 

TO 

M 

r 

TO  = 

C 

T? 

3 

TO 

-X  >, 

33 

4Jl 

0) 

>.4 

1 

TO 

CM 

*3 

•iH  ^ 

3 

TO 

c 

0) 

M 

0) 

3 

t 

TO 

TO 

T3 

O 

CO 

CM 

O 

TO 

•fH 

•M  M 

a 1 

3 

OJI 

u 

M 

C 

TO3 

O 

t; 

C 

rM 

M 

TO 

o 

M 

3 

CM 

I-H 

3 CU 

3 , 

3 

B 

o 

tM 

<i) 

•H 

0) 

•H 

0 

O 

CO 

i“H 

JP 

U 

3 

3 

O 

CM 

•iH  TO 

•H 

u 

TO 

CO 

0 

M 

Xi 

a 

TJ 

CO 

U 

O 

3 

0 

r 

*3  U 

u ' 

M 

TO 

<y 

0) 

•fH 

’O 

£ 

-3 

C 

CJ 

TO 

•'-)  CM 

3 

0 

M 

>*» 

bO 

bO  i 

3 

0) 

r-^ 

0> 

u 

73 

CO 

TO 

0 

TO 

TO 

TO 

CO 

TJ 

M 

TO 

3 

Xh 

r 

0 

•• 

> 

Cu 

0) 

o 

CJ 

fM 

CO 

W 

<7s 

4.’ 

u 

•3 

O 

M 

M TO 

CM 

u 

H 

CO 

TO 

•M 

, «« 

CO 

a 

CM 

TO 

M 

TO 

• 

0 

3 

TO 

CT) 

M 

•H 

3 

3 

a 

U 

M 

CO 

M 

u 

O 

T3 

U 

TO 

£ 

4h 

Cm 

TO 

•r-l 

•M 

0 

M 

TO 

^ «fH 

0 

TO 

>-• 

3 

O 

•H 

CO 

Q) 

TJ 

C 

CO 

> 

TO 

CM 

-3 

0 

£ 

X> 

3 

M 

M f-H 

•U  1 

^H 

0) 

0 

> 

CO 

> 

M 

> 

CO 

TO 

TO 

M 

•H 

TO 

TO 

3 

TO 

TO 

M 

bO 

TO  3 

'O 

u 

TO 

o 

•M 

■fH 

a) 

c 

M 

'3 

M 

M 

*3 

3 

CO 

Vh 

3 

4J 

3 

c 

CM  tH 

bO  1 

3 

bO  CM 

a 

M 

CM 

u 

•H 

CJ 

CO  CO 

TO 

X 

M 

rM 

•M 

3 

M 

TO 

TO 

CM  *3 

3 1 

H 

o, 

e 

0> 

<y  c 

•H 

TO 

• 

TO 

TO 

3 

M 

M 

TO 

3 

rX 

•rH 

•rH 

o 

\ 

/-N 

TO 

CM 

> o 

'm 

bO 

£ 

> 

> 

3* 

3 

> 

TO 

•rH 

•3  T3 

M 1 

CM 

3 

iM 

(N 

X 

CM 

•M  *H 

TO 

• 

CM 

TO 

TO 

TO 

TO 

j3 

0 

TO 

M 

rM 

3 3 

3 

Oi 

H 

w' 

u 

Q) 

•U  M 

> 

TO 

0 

CO 

Pi 

fM 

M 

M 

M 

Pi 

CL 

“ 

•H  lU 

M 

1 

f 

c ^ 

u 

1 

TO 

CM 

C 

1 

<v 

U 

£ TO 

O 

•M 

M 

•3 

1 

•rn 

0) 

(Jj  *H 

TO 

3 

M 

£ 

3 

T3 

CO  «•-» 

CO 

CO 

CM 

CO 

O 

3 

iM 

TO 

T3 

C 

TO 

M 

CM 

TO 

•H 

CM 

3 

• < 

TO 

TO 

0> 

a 

u 

•M 

rH 

4J 

O 

0 

M 

o 

> 

CM 

CO  M 

TO 

•o 

3 

3 

M 

•rH 

•M 

V4 

O 

TJ 

4J  d> 

M 

CM 

U 

3 

3 

XI 

a 

0) 

C 

U CM 

CM 

o 

CO 

iM 

TO 

3 

3 

o 

0) 

£ 

CO 

TO 

TO 

CO 

TO 

•H 

3 

3 

*3 

M 

J= 

1 

H 

M 

TO 

bO 

O,  ‘M 

3 

M 

rM 

> 

bO 

3 

3 

c 

>-i 

C 

CO 

-o 

O 

M 

3 

3 

TO 

3 

M 

TO 

>> 

3 

bQ 

c 

•M 

0) 

TO 

•M 

TO 

CO 

•M 

•M 

•r^ 

•3 

H3 

fM 

O 

o 

4J 

> 

> O 

M 

T3 

£ 

M 

X) 

M 

M 

3 

3 

o 

U 

•H 

TO 

•H 

•M  *H 

TO 

}H 

TO 

3 

O 

3 

M 

M 

M 

> 

CU 

M 

pc; 

U 

4J  M 

> 

o 

CO 

TO 

M 

< 

3 

3 

c 

X 

• 

1 

‘H 

o 

*3 

a; 

M 

3 

•M 

3 

> 

3 

c 

3 

3 

T3 

O 

TO 

N 

3 

0 

/•-N 

5 

3 

T3 

tn 

e o^ 

rM 

TO 

T3 

CO 

TO 

Os 

M 

3 

T3 

SO 

TO 

r-. 

T3 

)H 

sD 

CM 

<2> 

c^ 

rM 

CJs 

C 

os 

•3 

3 

C?s 

M 

Os 

iM 

0) 

iM 

2 

»M 

O 

rM 

pM 

o 

SM 

Cm 

'w' 

V-r- 

OQ 

vw' 

X 

V- 

28 

I 

J 


Summary  of  Studies  on  Order  of  Response  Alternatives 


r 


o 

Q> 

w 

1-^ 

CO 

0) 

CJ 

3 

CO 

•3 

•• 

o 

>-l 

3 

<u 

0) 

CJ 

•o 

3 

u 

1 0) 

4^ 

o 

0) 

V4  3 

X 

0)  o 

<u 

CO 

•H 

CO 

4-4  *H 

3 

B 

u 

4-i  4J 

4-i 

•H 

•H  (0 

'' 

3 

'O  J-» 

4-4 

o 

*o 

CO 

O 

q; 

0) 

u o 

CO 

C W 

CO 

o 

u 

CO  3 

0) 

4-t 

0) 

U }-i 

•H 

> 

•H  a 

4J 

0) 

0) 

4-t 

•r4 

V-4 

■H 

u 

3 O 

•H 

CO 

.3 

3 

•H  U 

<0 

•r4 

CO  0> 

•H 

6 

TJ 

0) 

V4 

O }-* 

<U 

o 

z;  o 

CO 

3 

I 

4-> 

O 

0) 

3 

C/3 

O 

z; 


4J 

c 

1 

<Q 

'V  u 

u u 

X "O 
u 


u 

c/3 


CO  <D 

4J  x:  60 
C OO  0) 
<y  *H  ^ 

T>  ^ i-< 

3 O 

4^  }-(  U 
CO  O 
-H  3 

m C J-» 
r-  3 
ON  AJ 


cn 

'O 

C vO 

o 

C/3  w 


3 

g 

C 

c 

3 

O 

'X) 


3 

1-4 

o 

^4 

O 

1—4 

4J 

• • 

CO 

4-4 

CD 

c 

u 

CO 

•r4 

u 

3 

CO 

u u 

u 

Q) 

4J 

4J  3 

ss 

3 

'O 

T3 

CO 

u 

3 *3 

6 

f> 

0) 

U 

3 

3 

• M 

V4 

3 

4J 

0 

CO 

4-» 

T3 

• O 

o 

U 

CO 

Cu 

U 

• 

3 

•H 

'3 

3 

CO 

o 

cn  i-< 

>> 

1— < 

3 

O 

CU 

a 

X 

CO 

3 

CO 

CO 

CO 

U 

CO 

O 

Q 

CM  J-4 

3 

^3 

B 

3 

U 

E 

3 

1—4 

3 

s 

> 

o 

CO 

c 

n > 

3 

3 

4J 

3 

? 

o 

r-4  3 

U 

O 

M 

U 

4J 

cn 

3 

N-^  U 

CO 

f— 4 

CO 

CO 

4J 

CO 

1 

CO 

3 

X 

3 

•H 

o 

V-t 

CO 

4J 

u 

3 

4-4 

3 

a 

4J 

3 

U 

3 

3 

o 

4-4 

•r*4 

U 

3 

CO 

o 

3 

O 

4J 

a 

T3 

3 

3 

•r^ 

o 

CO 

3 

1—4 

3 

3 

C-4 

3 

3 

o 

}.< 

V4 

•r4 

T3 

3 

4-4 

X 

CO 

3 

4-4 

3 

B 

3 

)-i 

3 

3 

4-1 

U 

O 

Ot^ 

^»4 

O 

(U 

4-4 

a 

o 

U Z-\ 

X 00 
u m 
c ON 


V-29 


J 


Chapter  VI 


A 


NUMBER  OF  RESPONSE  ALTERNATIVES  AND 
RESPONSE  ANCHORING 


The  effects  of  variation  in  the  presentation  of  questionnaire  items, 
including  the  order  of  response  alternatives,  was  discussed  in  Chapter  III. 

This  chapter  considers  two  related  topics:  the  number  of  response  alterna- 

tives to  employ;  and  response  anchoring. 

Issues  Regarding  Number  of  Response  Alternatives  to  Employ 

One  of  the  basic  issues  in  the  use  of  any  given  rating  instrument  or 
attitude  scaling  device  is  the  determination  of  the  optimum  number  of 
response  alternatives  or  categories.  Researcher's  habit,  or  tradition 
rather  than  solid  empirical  support,  often  has  led  to  the  recurrent  use 
of  five  point  Likert  scales,  seven  point  semantic  differential  scales,  and 
so  on.  The  reason  for  concern  with  the  number  of  response  alternatives 
stems  from  the  belief  that  a "coarse"  scale  with  too  few  response  alterna- 
tives may  result  in  a loss  of  information  concerning  subjects'  discrimina- 
tion powers,  or  reduced  cooperation  in  rating  reflecting  a dislike  for 
"forcing"  a judgment.  An  extremely  "fine"  scale,  with  too  many  response 
alternatives,  may  go  beyond  the  subjects'  powers  of  discrimination,  be 
excessively  time  consuming,  or  difficult  to  score. 

The  literature  search  in  the  area  of  number  of  response  alternatives 
was  very  productive.  Over  30  studies  were  found  which  were  directly 
related  to  this  issue.  Table  VI-1  summarizes  the  literature.  The  final 
three  columns,  headed  Reliability,  Validity,  and  Other  Findings,  illustra- 
te that  multiple  criteria  have  been  used  in  investigating  the  issue  of 
the  optimum  number  of  response  alternatives  to  employ.  The  major  criteria 
used  have  been  reliability,  validity,  factors  influencing  subjects'  moti- 
vation and  ability  to  respond,  and  scoring  and  data  analysis  considerations. 

Each  of  these  criteria  will  be  discussed  below. 

Reliability.  Numerous  studies  (Bendig , 1953 ; Bendig , 1954a ; Komarita  & 
Graham,  1965;  Jacoby  & Matell,  1971;  Masters,  1973;  Matell  & Jacoby,  1971;  Saun- 
ders & Ward,  1964)  in  the  psychometric  literature  have  shown  that  increasing  the 
number  of  response  alternatives  does  not  necessarily  increase  reliability. 

These  empirical  efforts  have  employed  a wide  variety  of  response  alterna- 
tive combinations  as  experimental  treatments,  for  example,  two  through  19 
alternatives  inclusively,  and  were  conducted  using  several  types  of  rating 
scales  in  the  context  of  numerous  topical  areas.  It  should  be  noted  that 
all  the  studies  above  except  the  two  Jacoby  and  Matell  efforts  were  con- 
cerned with  internal  consistency  measures,  that  is,  equivalent  forms  or 
split-half  reliability.  Jacoby  and  Matell  (1971)  examined  both  internal 
and  temporal  (test-retest)  consistency  and  concluded  that  both  measures 
were  independent  of  the  number  of  response  alternatives. 


(Table  continued  on  next  page) 


I 


VI  c « 
o o *^  > 
> c — 


» O 
I (M  X> 


« -4  E 


of  Studies  Relatln 


Saunders  & 2 vs.  multiple  Bipolar  scales/  282/collcge  No  difference  in  reliability  Efficiency  - no  difference 

Vard  (1964)  choice  personality  In  the  proportion  of  positive 

responses 


cont 


Several  examples  of  studies  exist  in  the  literature  which  indicate 
that  a nonlinear  relationship  exists  between  the  range  of  response  alter- 
natives and  the  magnitude  of  the  coefficient  of  reliability.  For  example, 
in  separate  investigations  Sendig,  (1953  , 1954a,  1954c)  found:  reliabil- 

ity was  equal  for  three,  five,  seven  and  nine  point  scales,  but  lower  for 
11  alternatives;  rater  reliability  (suimned  ratings  for  each  object  rated) 
was  constant  from  five  to  nine  alternatives  but  slightly  higher  at  three 
and  slightly  lower  at  two  categories;  and  rater  reliability  was  highest 
with  four  alternatives  and  lowest  with  two  alternatives.  Neidt  and  Merrill 
(1951)  reported  that  the  reliability  of  a five  point  rating  scale  was  slightly 
higher  than  a two  alternative,  positive-negative,  format.  In  a person- 
ality assessment  study,  Symonds  (1924)  contended  that  fewer  than  seven 
scale  alternatives  resulted  in  a loss  of  reliability,  but  employing  great- 
er than  seven  did  not  improve  reliability. 

The  above  studies  seem  to  suggest  that  there  is  an  optimal  number  of 
response  alternatives  to  employ  for  any  given  investigation  situation, 
including  the  topic  area,  characteristics  of  the  subjects,  etc.  For  exam- 
ple, in  other  studies  results  were  dependent  on  the  type  of  rating  instru- 
ment used.  Komorita  and  Graham  (1945)  found  that  increasing  the  number  of 
response  alternatives  improved  reliability  for  heterogeneous  scales  with  dis- 
similar item  content,  but  had  no  effect  on  homogeneous  scales.  Masters  (1973) 
demonstrated  that  the  reliability  of  a traditionalism  of  education  scale 
was  independent  of  the  number  of  response  alternatives,  but  in  a progress- 
ivism  scale  reliability  increased  from  two  to  three  alternatives. 

Validity . Jacoby  and  Matell  (1971)  pointed  out  that  most  of  the 
psychometric  literature  dealing  with  the  number  of  alternative  issues 
emphasized  reliability  as  the  major,  and  in  some  cases,  only  criterion 
in  the  choice  of  the  number  of  seal*  points.  They  felt,  however,  that 
the  ultimate  criterion  is  the  effect  a change  in  the  number  of  scale  points 
has  on  the  validity  of  the  scale.  Table  VI-1  illustrates  that  only  two 
original  studies  addressed  the  validity  question. Neidt  b Merrill  (1951)  in  an 
attitude  toward  education  investigation  reported  no  difference  in  con- 
current validity  between  two  and  five  alternative  rating  scales.  This 
study  reported  mean  course  marks  and  scale  scores,  holding  constant  ACE 
scores  and  hours  studied  per  week. 

The  only  study  examining  both  concurrent  validity,  with  attitudes 
and  behavior  measured  at  one  point  in  time,  and  predictive  validity  (cor- 
relation of  observed  behavior  with  that  which  was  predicted  from  attitude 
measures)  was  Jacoby  and  Matell  (1971).  In  relation  to  both  measures, 
the  authors  concluded  that  no  consistent  relationship  existed  between 
either  measure  and  number  of  response  alternatives  employed. 

Although  the  evidence  is  censistent,  the  lack  of  numerous  studies 
using  divergent  types  of  subjects,  instruments  and  topics,  makes  it  diff- 
icult to  reach  a conclusion  regarding  the  effect  of  the  number  of  respense 
alternatives  on  validity. 


VI-7 


Factors  influencing  subjects'  motivation  to  respond  and  efficiency 
of  response.  T’lis  section  addresses  a series  of  related  matters: 
subject's  preferences  for  and  ability  to  use  scales  with  a varied  number 
of  response  alternatives.  Direct  and  indirect  measures  of  subject  prefer- 
ences and  motivation  have  been  examined.  Direct  measures,  preferences 
ratings,  were  used  in  studies  conducted  by  Matell  (1970)  and  Strahan 
(1971).  However,  both  studies  used  college  students  as  subjects.  Experi- 
mental results  were  consistent:  college  students  reported  a preference 

for  using  finer  scales.  In  Matell's  investigation,  scales  of  nine  to 
thirteen  alternatives  were  preferred,  and  Strahan  reported  significantly 
higher  preferences  for  using  "several"  alternatives  over  a true-false 
format. 

Indirect  or  proxy  measures  of  subject  preferences  and  motivation 
include  response  time  and  number  of  "uncertain"  and  "no  responses."  Matell 
(1970)  presented  evidence  that  no  difference  in  total  time  for  administra- 
tion was  shown  for  two  through  nineteen  alternatives.  Sevan  and  Avant 
(1968)  and  Matell  and  Jacoby  (1972),  however,  reported  that  testing  time 
increased  as  a direct  function  oi  -...le  number  of  alternatives,  thus  support- 
ing the  more  intuitively  plausible  findings.  Concerning  the  relationship 
between  the  use  of  "uncertain"  responoe  categories  or  "no  responses"  and 
scale  length,  the  literature  supported  the  conclusion  that  increasing 
the  number  of  response  alternatives  decreased  uncertain  and  non-responses 
to  scale  items  (Matell,  1970;  Hughes,  1969;  Matell  & Jacoby,  1972; 

Ghiselli,  1939;  Dunette,  Alyward  & Uphoff , 1956;  Tsudzuki,  1953;  Zucker- 
man,  1952).  For  example,  Tsudzuki  (1953)  studied  the  nature  of  non-resp®nse 
in  a two  category  (yes-no)  questionnaire.  This  was  done  by  administering 
the  same  test  to  the  same  group  with  additional  categories  such  as  "in- 
between,"  "cannot  decide,"  and  with  two  different  intensities  of  "agree" 
and  "disagree."  The  latter  method  significantly  reduced  the  percentage  of 
non-response.  Ghiselli  (1939)  noted  that  the  use  of  yes-no  responses 
generally  rated  a product's  advertising  as  less  sincere  than  when  a four- 
step  scale  was  used.  He  felt  that  people  were  more  willing  to  respond  to 
a four-step  scale.  Hughes  (1969)  concluded  that  the  use  of  forced  choice 
scales  results  in  a confounding  of  indifference  and  awareness. 

Efficiency  of  response,  or  the  ability  to  use  scale  paints  in  dis- 
criminating among  objects  and/or  attributes,  and  response  style  (such  as 
yeasaying)  have  also  been  examined  in  relation  to  the  number  of  response 
alternatives.  Matell  and  Jacoby  (1972)  determined  that  the  proportion  of 
scale  used  was  independent  of  the  number  of  response  alternatives.  Several 
studies  suggest  that  yeasaying  tendencies,  as  measured  by  the  proportion 
of  positive  responses,  are  unaffected  by  scale  length  (Goldsamt,  1972; 
Saunders  & Ward,  1964;  Tuclcman  6e  Lorge , 1953). 

The  literature  reviewed  in  this  section  is  mostly  subject  to  the 
limitation  of  being  drawn  from  college  student  populations.  Subjects 
of  different  education,  occupational,  and  age  levels  may  be  less  predis- 
posed to  respond  to  fine  scales. 


VI-8 


Scoring  and  data  analysis  cansiderations . Scoring  and  data  analysis 
considerations  may  affect  the  selection  of  the  number  of  response  alter- 
natives to  be  used  in  any  given  study.  Several  studies  compared  dichoto- 
mous /trie  ho  tomous  scoring  methods  to  the  normal  summated  scoring  procedures 
and  reached  the  conclusion  that  results  (differences  in  attitudes)  were 
not  affected  by  the  method  of  scoring  (Matell,  1970;  Matell  and  Jacoby, 

1971;  and  Goldsamt,  1972).  However,  for  these  specific  investigation 
situations  two  or  three  response  alternatives  might  have  been  the  optimal 
number  to  employ. 

Several  problems  with  a two  or  three  way  scoring  procedure  exist 
which  are  statistical  in  nature.  If  Chi  Square  tests  are  sufficient, 
two  or  three  categories  might  be  adequate.  However,  if  nonparame tr ic 
rank  order  correlations  are  to  be  employed,  substantial  "ties"  on  ranks 
will  result.  Also,  if  parametric  statistics  are  to  be  employed,  the  more 
alternatives  the  better,  because  of  the  assumption  of  continuous  distribu- 
tions or  interval  scale  properties.  Finally,  another  analytical  issue  of 
concern  with  the  use  of  two  or  three  point  scales  is  the  reproducibility 
of  the  original  data  configuration,  an  issue  important  in  the  use  of  multi- 
dimensional scaling.  Using  simulated  data.  Green  and  Rao  (1970)  demonstra- 
ted that  recovery  is  poor  with  two  or  three  alternatives,  and  that  diminish- 
ing returns  set  in  rapidly  beyond  eight  alternatives.  Other  considerations 
related  to  scoring  questionnaires  are  discussed  in  Chapter  XI. 

Summary  and  Conclusions.  The  state  of  the  literature  was  probably 
best  summarized  by  Ghiselli  and  Brown  in  Personnel  and  Industrial  Psychology 
(1948)  and  by  Guilford  in  Psychometric  Methods  (1954) . These  authorities 
contend  that  the  optimal  number  of  response  alternatives  is  a matter  for 
empirical  determination  in  any  situation,  and  suggest  that  there  is  a wide 
range  of  variation  in  refinement  around  which  the  optimal  point  in  relia- 
bility changes  very  little.  It  Jould  appear,  however,  that  additional 
research  in  the  area  might  be  warranted  covering:  the  different  types  of 

rating  scales;  various  topical  areas  of  research;  and  subjects  with  differ- 
ent ability,  educational,  and  sociodemographic  charac teristics . From 
such  studies  more  information  would  be  available  regarding  the  optimal 
number  of  response  alternatives  to  employ  for  any  specific  investigation 
situation. 


Response  Anchoring 

This  section  contains  a summary  of  research  findings  concerning 
response  anchoring,  including:  types  of  response  anchors;  anchored  versus 

unanchored  scales;  amount  of  verbal  anchoring;  selection  procedures  for 
verbal  scale  anchors;  and  balanced  versus  unbalanced  scales. 

Types  of  response  anchors.  The  researcher's  judgment  has  typically 
determined  whether  response  anchors  are  to  be  verbal,  numerical,  graphic, 
or  some  combination.  In  its  original  form,  the  semantic  differential  was 
in  thci  following  graphic  form  (Osgood,  Suci  & Tannenbaum,  1957).  Respond- 
ents were  instructed  to  place  an  X on  the  line  that  represented  their 


VI-9 


attitude . 


Strong  : : : : : : : Weak 

This  represents  a combination  graphic  and  verbal  scale.  The  Likert 
method  calls  for  a verbal  rating  (strongly  agree  through  strongly  disagree) 
to  a directional  statement  phrased  either  positively  or  negatively.  For 
example : 

The  Modern  Volunteer  Army  (MVA)  place  too  much  emphasis  on  extrin- 
sic factors  (e.g.,  beer  in  barracks)  as  opposed  to  intrinsic,  job 
related  factors  (e.g.,  pay,  supervision). 

Agree  Strongly  Agree  ^Undecided  Disagree  ^Disagree  Strongly 

Few  studies  were  uncovered  in  the  literature  review  which  systemati- 
cally investigated  the  effect  of  type  of  response  alternative  employed. 

In  a study  to  validate  consumer  attitudes  concerning  various  brands  against 
actual  purchases,  Abrams  (1966;  tested  four  combinations  of  rating  devices; 

1.  Verbal  anchors  with  a -5  through  +5  numerical  continuum,  e.g.: 

Definitely  Definitely 

Dislike  Like 

-5  -4  -3  -2  -1.  0 +1  +2  +3  +4  +5 

2.  Verbal  anchors  with  a 1 through  10  numerical  continuum,  e.g.; 

Definitely  Definitely 

Dislike  Like 

1 23456789  10 

3.  A verbal  and  numerical  continuum,  e.g.: 

Dislike  Dislike  Neither  Like  Like 

Complete-  Some-  Dislike  like  nor  Like  a Some-  Complete- 
ly what  a little  Dislike  Little  what  ly 

1 2 3 4 5 6 7 

4.  Verbal  continua  , e.g.; 

Below  About  A Little  A lot  One  of  None 

Average  Average  Better  Better  the  Best  Better 

Experimental  findings  illustrated  that;  average  scale  scores  are  relative- 
ly constant  regardless  of  scale  type;  scale  4 had  a lower  average  predic- 
tion error  (the  differences  between  predicted  brand  share  and  actual  con- 
sumer purchases);  and  scale  4 had  a far  smaller  amount  of  clustering  of 
responses  at  the  extreme  positive  position.  These  findings  also  confirm 
the  conventions  of  researchers  who  do  not  include  numerical  response  al- 
ternatives in  an  attitude  measurement  scale. 


Several  additional  studies  were  found  which  support  the  use  of  verbal 
anchoring  and  verbally  defined  response  alternatives.  In  a study  employing 


Air  Force  personnel  as  subjects  (Madden,  1964),  three  fonns  of  rating 
scales  were  used:  (1)  each  scale  alternative  was  verbally  defined  and 

illustrated;  (2)  neither  definitions  nor  examples  were  used  (numerical 
scale);  and  (3)  definitions  were  used  but  examples  were  eliminated 
(verbal  scale).  Forms  (1)  and  (3)  were  equally  reliable  and  of  greater 
reliability  than  form  (2).  Form  (3)  was  preferred  because  it  was  simpler 
and  less  time  consuming  for  raters  to  use.  Peters  and  McCormick  (1966) 
compared  the  effectiveness  of  job-task  anchored  (verbal)  equal  appearing 
intervals  scales  and  simple  numerically  anchored  scales.  The  job-task 
anchored  scales  were  found  to  have  significantly  greater  reliability. 

Marsh  and  Perrin  (1925)  compared  the  effectiveness  of  the  graphic 
scale,  percentage  scale,  and  man-to-man  scale.  On  the  graphic  scale, 
raters  underscored  the  description  most  applicable  to  the  subject.  On 
the  percentage  scale  the  raters  placed  a check  mark  in  the  column  repre- 
senting the  subject's  standing,  in  terms  of  the  perceived  amount  of  a 
given  trait  possessed,  with  reference  to  a preliminary  group  of  subjects. 
With  the  man-to-man  scale  the  subjects  were  compared  with  particular  in- 
dividuals representing  the  standards  for  the  traits  rated.  The  results 
failed  to  demonstrate  the  superiority  of  any  one  form  of  scale,  with  the 
range  of  average  deviations  from  agreement  being  extremely  limited  regard- 
less of  the  form  of  scale  employed.  Ross  (1966)  compared  man-to-man  job 
performance  ratings  with  ratings  from  an  anchored  rating  scale  for  their 
validity  in  guiding  salary  decisions  in  a research  and  development  organ- 
ization. The  man-to-man  comparison  procedure  was  found  to  be  as  valid 
as  the  anchored  ratings.  However,  the  two  methods  diverged  in  important 
practical  ways  in  the  results  they  produced. 

Two  other  studies  reported  the  favorability  of  using  verbal  scale 
anchors,  although  neither  compared  verbal  to  other  types  of  anchors.  In 
a study  of  supervisory  style  of  head  nurses.  Smith  and  Kendall  (1963) 
anchored  evaluative  rating  scales  with  examples  of  expected  supervisory 
behavior.  The  examples  were  selected  by  independent  consensus  of  a number 
of  head  nurses.  Scale  reliabilities  ranged  above  .97. 


Barrett,  Taylor,  Parker  and  Martins  (1958)  administered  four  rating 
scale  formats  varying  from  unstructured  to  highly  structured  in  nature. 
Second  line  supervisors  rated  clerical  workers.  The  verbal  format  employ- 
ing trait  titles  and  behavioral  descriptions  of  scale  steps  demonstrated 
higher  inter-rater  reliability,  less  halo,  and  less  leniency  than  did  the 
more  structured  or  less  structured  formats.  It  should  be  noted  that  this 
study  concerned  the  amount  of  verbal  cues  along  a scale. 

Based  upon  the  studies  reviewed  in  this  section,  it  appears  that 
empirical  support  exists  to  conclude  that  the  reliability  of  scales  with 
verbal  anchors  and  verbal  response  alternatives  is  superior  to  that  of 
numerical  and  other  combinations  of  verbal  and  numerical  scales.  Little 
evidence  was  provided  regarding  graphic  rating  techniques.  It  should  be 
noted  ,;hat  none  of  the  studies  addressed  the  issue  of  comparative  validity 
or  subjects'  preferences  and/or  ability  to  use  the  rat'ng  instrument. 


VI-U 


Anchored  versus  unanchored  scales.  A number  of  studies  have  been 
conducted  on  the  topic  known  as  "anchoring  effects."  This  section  will 
focus  on  differences  in  research  results  caused  by  the  use  of  anchored 
versus  unanchored  scales.  It  should  be  noted  that  all  the  studies 
which  follow  compared  unanchored  scales  to  scales  with  one  end  anchored. 
Many  of  the  studies  varied  the  anchoring  of  the  left  or  right  end  of  the 
scale . 

An  early  study  in  anchoring  effects  in  the  judgment  of  verbal  mater- 
ials was  reported  by  McGarvey  (1943).  Subjects  were  asked  to  scale  state- 
ments about  the  social  prestige  of  occupations  and  undesirable  forms  of 
behavior.  Scales  used  in  the  experiment  were  either  unanchored  or  had 
either  one  of  the  two  extreme  points  verbally  anchored.  Results  indicated 
that  in  unanchored  scales  the  absolute  scale  tended  to  be  "stimulus  anchor- 
ed," i.e.,  anchored  by  tlie  question  stem.  With  either  of  the  end  points 
anchored,  the  tendency  was  to  move  from  the  stimulus  value  on  the  absolute 
scale  toward  the  anchored  extreme.  This  anchoring  effect  has  been  confirm- 
ed by  Rogers  (1941)  and  Volkman  (1936).  Rogers'  study  also  examined  con- 
fidence in  ratings  and  judgment  time.  Confidence  was  only  slightly  affect- 
ed due  to  anchoring,  but  was  found  decreasing  in  higher  categories  nearer 
the  anchor.  Judgment  time  decreased  with  anchoring.  In  a reexamination 
of  Volkman 's  experiment.  Hunt  and  Volkman  (1937)  were  unable  to  conclude 
with  certainty  that  an  anchor  effect  existed.  An  incomplete  shift  in  scale 
values  occurred.  That  is,  the  movement  did  not  include  the  subject's  own 
stimulus  anchor,  his  most  pleasant  color. 

Several  examples  of  conflicting  studies  to  the  above  anchor  effect 
investigations  are  available.  For  example,  Weiss  (1961)  used  two  separate 
experimental  groups  in  a study  concerning  attitude  toward  delinquents; 
the  experiments  cited  above  used  the  same  subjects  with  an  anchored  scale 
following  an  unanchored  rating.  One  group  was  given  an  extreme  statement 
as  a standard  for  a punitive  category,  and  the  other  was  given  no  stand- 
ard. A contrast  effect,  movement  in  ratings  away  from  the  anchor  state- 
ment, was  produced  by  the  extreme  standard.  Hunt  (1941)  offered  evidence 
supportive  of  this  contrast  effect.  In  conclusion  to  this  experimental 
study,  he  commented:  "if  judgments  made  with  an  unanchored  scale  be  re- 

peated with  the  scale  anchored  by  the  further  definition  of  one  extreme, 
there  is  a shift  in  the  average  value  of  the  stimulus  judgments,  and  this 
shift  is  in  a direction  away  from  the  anchoring  value."  This  would  mean 
that  when  a scale  was  anchored  at  its  low  or  negative  extreme,  the  ratings 
tei.d  to  rise  or  be  more  positive,  and  vice  versa. 

Frawlcy  (1943)  had  115  seminarians  rate  100  statements  about  war, 
belief  in  God,  birth  control,  etc.  His  experimental  procedure  was  similar 
to  the  previously  cited  studies  --  subjects  rated  the  statements  first 
on  an  unanchored  scale  and  the  next  day  rated  them  again  on  a scale  with 
the  most  unfavorable  end  of  the  scale  anchored.  The  fact  that  the 
Spearman  rank  order  correlations  were  extremely  high  for  the  two  sets  of 
data  indicated  minimal  presence  of  anchor  effects. 


VT  - 19 


Because  of  this  conflicting  evidence  it  cannot  be  concluded  that 
use  of  a single  verbal  anchor  produces  anchor  effects,  contrast  effects, 
or  indeed  any  significant  differences  in  rating  scale  average  scores. 

Most  of  the  studies  seem  limited  due  to  the  use  of  the  same  subjects  in 
simple  before  (unanchored)  and  after  (anchored)  experimental  designs.  It 
must  also  be  noted  that  the  investigations  cited  in  this  area  were  theore- 
tical inquiries  into  common  principles  or  "laws"  governing  ratings  or  judg- 
ments, and  were  less  concerned  with  strengths  and  weaknesses  of  types  of 
scales.  Finally,  the  fact  that  not  one  of  the  investigations  cited  employ- 
ed verbal  anchors  at  both  ends  of  the  scale  mfkes  it  even  more  difficult  to 
conclude  whether  anchored  or  unanchored  scales  should  be  employed. 

Amount  of  verbal  anchoring.  A few  research  studies  have  addressed 
the  issue  of  the  effects  of  varying  the  amount  of  verbal  anchoring  on 
rating  scales.  Bendig  (1953  ) examined  the  impact  of  amount  of  verbal 
anchoring  in  a study  where  225  college  students  rated  themselves  as  to 
how  much  they  knew  about  12  foreign  countries.  Alternative  scales  presen- 
ted to  subjects  had  the  center  category  defined,  the  end  categories  de- 
fined, or  both  center  and  end  categories  defined.  Results  indicated  that 
the  reliability  of  the  scales  increased  with  added  scale  anchoring.  In  a 
separate  report  of  the  same  data  base,  Bendig  and  Hughes  (1953)  concluded 
that  increased  verbal  anchoring  also  resulted  in  a slight  increase  in  the 
amount  of  information  transmitted  by  the  scale.  The  use  of  "information 
transmitted"  here  is  in  the  context  of  information  theory,  meaning  more 
descriptive  data  from  the  respondents. 

Abrams  (1966)  compared  seven  point  scales  with  only  the  end  categor- 
ies defined  to  scales  with  the  entire  continuum  verbally  anchored.  The 
study  examined  consumer  mail  panel  respondent's  attitudes  toward  national 
brands  of  toothpaste  and  scouring  cleanser.  Shelf  inventories  of  actual 
purchases  of  products  in  these  categories  were  conducted  in  a follow-up. 
Scales  which  had  verbal  descriptors  for  all  response  alternatives  were 
better  predictors  of  purchase  behavior.  These  scales  also  displayed 
greater  respondent  use  of  the  range  of  response  alternatives,  with  less 
clustering  at  the  extreme  positive  position. 

Another  study  in  support  of  increased  verbal  anchoring  was  offered 
by  Madden  (1964) . Four  job  evaluation  factors  were  used  as  the  basis  of 
rating  10  Air  Force  specialties.  For  each  factor  three  different  methods 
of  anchoring  were  used:  (1)  each  response  alternative  was  defined  and 

illustrated;  (2)  neither  definitions  nor  examples  were  used;  and  (3)  defi- 
nitions were  used  but  examples  were  omitted.  Methods  (1)  and  (3)  were 
approximately  equal  in  reliability,  both  yielding  more  reliable  scales 
than  Method  (2). 

Only  one  empirical  study  was  uncovered  which  offers  evidence  some- 
what in  conflict  with  the  above  findings.  Carter,  Ruggels,  and  Chaffee 
(1968)  conducted  an  experiment  using  15  semantic  differential  scales  used 
to  describe  12  objects  (concepts  about  schools) . One  hundred  and  thirty- 
five  female  teachers  wore  given  the  opportunity  to  modify  the  scales  during 
rating.  Assuming  that  not  every  adjective  scale  was  a useful  descriptor, 
subjects  wore  given  one  polar  adjective  for  each  scale  and  were  asked  to 


fill  in  the  appropriate  opposite  or  note  "wouldn't  use  scale"  or  "don't 
know."  The  authors  concluded  that  for  four  of  the  15  scales,  the  polar 
opposite  chosen  by  the  subjects  was  not  suggested  by  Osgood.  Also,  the 
authors  noted  that  whatever  the  merits  of  anchoring  both  ends  of  every 
scale  to  measure  meaning,  it  appeared  that  subjects  can  more  accurately 
devote  their  descriptions  to  objects  when  one  end  of  the  scale  is  left 
for  them  to  describe.  Perhaps  this  study  raises  more  doubts  about  the 
validity  of  the  semantic  differential  technique  than  it  offers  concrete 
evidence  regarding  the  amount  of  verbal  anchoring. 

In  conclusion,  the  limited  number  of  research  studies  cited  above 
are  somewhat  consistent  in  reporting  greater  scale  reliability  with  added 
verbal  anchoring.  Also,  one  experiment  (Abrams,  1966)  offered  evidence 
of  higher  predictive  validity  with  more  verbal  anchoring. 

Selection  procedures  for  verbal  scale  anchors.  This  section  presents 
literature  which  dealt  with  appropriate  procedures  for  the  selection  of 
verbal  scale  anchors.  The  studies  relate  to  verbal  anchors  used  in 
Likert  scales,  semantic  differential  scales,  and  rating  scales. 


A complete  bipolar  adjective  screening  methodology  for  semantic  dif- 
ferential scales  has  been  outlined  by  Lusk  (1973) . This  procedure  seems 
highly  applicable  to  any  research  considering  the  use  of  the  semantic  dif- 
ferential. The  process  suggested  is  as  follows: 

1.  Select  from  Osgood's  Thesaurus  Study  a set  of  bipolar  adjectives 
for  each  factor  dimension,  evaluative,  potency,  and  activity,  appli- 
cable to  the  study. 

2.  Select  a pretest  sample  representative  of  the  final  population  in 
the  study. 

3.  Prepare  the  pretest  concept/adjective  test  blocks,  randomizing  the 
concepts  and  bipolar  adjectives. 

4.  Administer  the  pretest  semantic  differential  scales  and  compare 
variances  (test  for  differences)  from  the  scale  midpoint.  The 
objective  here  is  to  eliminate  a preponderance  of  raidinterval 
responses,  of  each  bipolar  adjective  for  each  concept  evaluated. 

5.  Order  the  bipolar  adjectives  based  upon  their  respective  vari- 
ances, high  to  low. 

6.  Select  the  required  number  of  bipolar  adjectives,  i.e.,  those  sets 
with  significantly  lower  variances,  F test,  from  midpoints  may  be 
e limina  ted . 

One  additional  insight  into  the  selection  of  bipolar  adjectives 
should  be  mentioned.  Carter,  Ruggels,  and  Chaffee  (1968)  reported  that 
bipolar  adjectives  such  as  sweet-ferocious  were  of  little  value  in  rating 
inanimate  objects,  such  as:  "Are  boulders  sweet-sour?"  But  they  were 

useful  with  relational  concepts.  Again,  caution  must  bo  exercised  in 
selecting  bipolar  adjectives  or  phrases. 


Smith  and  Kendall  (1963)  tested  a procedure  for  the  construction  of 
evaluative  rating  scales  anchored  by  examples  of  expected  behavior. 
Expectations,  based  on  having  observed  similar  behavior,  were  used  to 
permit  rating  in  a variety  of  situations.  Examples,  submitted  by  head 
nurses  as  illustrations  of  nurses’  behavior  related  to  a given  dimension, 
were  retained  only  if  reallocated  to  that  dimension  by  other  head  nurses. 
They  were  then  scaled  as  to  desirability.  Agreement  for  a number  of 
examples  was  high,  and  scale  reliabilities  ranged  above  .97. 

In  general,  the  interpretation  of  the  above  studies  is  that  pretests 
for  the  selection  of  verbal  anchors  are  valuable  in  building  scale  content 
validity  and  reliability.  Rather  than  employing  anchors  which  seem  appro- 
priate to  the  researcher,  anchors  should  preferably  be  selected  by  respon- 
dents similar  to  those  who  will  be  participating  in  the  study. 

Balanced  versus  unbalanced  scales.  Historically,  the  balanced  scale 
has  been  preferred  by  researchers.  A scale  is  balanced  when  it  has  an 
equal  number  of  response  alternatives  on  either  side  of  the  scale's 
"indifferent''  category.  For  example,  the  following  verbal  scale  is  bal- 
anced : 

How  would  you  describe  the  Volunteer  Army? 

Very  Progressive 

Progressive 

Moderately  Progressive 

Neither  Progressive  nor  Conservative 

Moderately  Conservative 

Conservative 

Very  Conservative 

Unbalanced  scales  have  been  employed  when  pretest  results  demonstrated 
that  subjects,  by  using  extreme  response  alternatives  on  a scale,  produced 
a skewed  distribution  of  responses  rather  than  the  statistically  desirable 
normal  distribution  around  the  mean  attitude.  To  minimize  "end  piling," 
unbalanced  scales  have  been  used.  More  response  alternatives  are  added  to 
the  end  of  the  scale  where  the  piling  is  likely  to  occur.  This  practice 
tends  shift  the  distribution  of  responses  along  the  scale  continuum. 

For  example,  the  following  scale  is  heavily  unbalanced  on  the  favorable 
end : 


What  is  your  reaction  to  the  "beer  in  the  barracks"  policy? 

Enthusiastic 

Extremely  Favorable 

Very  Favorable 

Favorable 

Fair 

Poor 

Only  one  empirical  study  was  found  in  the  literature  review  which 
dealt  with  the  comparative  effects  of  balanced  versus  unbalanced  scales. 
In  the  study  (Weiss,  1963b),  350  college  students  judged  the  social 


VI-15 


prestige  of  400  occupations.  Four  types  of  scales  were  examined,  each 
with  a zero  category  designating  average  prestige.  They  were: 

1.  Three  category  balanced  with  an  equal  number  of  plus  and  minus 
categories . 

2.  Seven  category  balanced  with  an  equal  number  of  plus  and  minus 
categories . 

3.  Five  category  unbalanced  with  a single  plus  category. 

4.  Five  category  unbalanced  with  a single  minus  category. 

The  author  concluded  that  relative  to  the  balanced  scales,  the  unbalanced 
scales  induced  a shift  in  the  prestige  value  of  the  "average"  category  in 
the  direction  of  the  single-nondiscriminating  category.  In  other  words, 
significant  differences  between  scales  occurred.  Unfortunately,  this  in- 
vestigation did  not  report  data  relating  to  the  comparative  reliability  or 
validity  of  balanced  and  unbalanced  scales. 

Based  upon  a single  study,  obviously  no  conclusions  can  be  drawn  re- 
garding the  use  of  balanced  or  unbalanced  scales.  Intuitively,  the  use 
of  balanced  scales  seems  to  be  warranted  to  avoid  biasing  results  with  the 
presence  of  more  favorable  (or  unfavorable)  response  categories. 


VI-16 


ORDER  OF  PERCEIVED  FAVORABLENESS  OF  COMMONLY 
USED  WORDS  AND  PHRASES 


This  chapter  is  concerned  with  the  words  or  phrases  which  are 
commonly  used  as  response  alternatives  in  questionnaire  items.  It  is 
often  necessary  to  arrange  the  words  or  phrases  along  some  continuum  or 
in  some  order  of  degree,  and  several  studies  have  been  conducted  to  estab- 
lish this  order.  These  studies  will  be  discussed  in  this  chapter.  Studies 
concerned  with  determining  the  perceived  favorableness  of  words  and  phrases 
are  described  below  in  terms  of  the  instruments  used,  type  and  number  of 
subjects,  and  method  of  determining  the  scale  values.  When  available,  lists 
of  words  or  phrases  and  scale  values  have  been  included. 


Major  Studies  and  Lists  of  Adjectives  and  Scale  Values 


One  of  the  first  studies  on  the  perceived  favorableness  of  adjectives  was 
conducted  by  Hosier  (1940,  1941a).  In  this  study  296  adjectives  were  rated 
on  an  11  point  scale  anchored  at  1 with  "most  unfavorable,"  at  6 with 
"neither  favorable  nor  unfavorable,"  and  at  11  with  "most  favorable."  Zero 
was  used  if  the  adjective  could  not  be  rated.  Each  adjective  or  adjective 
phrase  was  judged  by  approximately  140  students  from  introductory  or  second 
year  psychology  courses.  Twenty-six  of  the  296  words  were  scaled  by 
Thurstone's  method  of  successive  intervals,  using  the  stimulus  "completely 
unsatisfactory"  as  the  standard,  with  its  mean  at  zero  and  its  standard 
deviation  equal  to  one.  The  medians,  scale  values,  and  standard  deviations, 
for  these  26  words  are  given  in  Table  VII-1  (Hosier,  1940).  The  method  of 
equal  appearing  intervals  was  also  used  to  find  the  scale  value  for  each  of 
the  296  words  A sample  list  of  14  words  is  shown  in  Table  VII-2.  In  this 
study  correlation  coefficients  were  computed  on  six  words  (neutral,  normal, 
excellent,  desirable,  disgusting,  and  unsatisfactory)  which  were  repeated 
in  the  list  presented  to  the  subjects.  The  correlation  coefficients  for 
these  words  ranged  from  .90  to  .99.  In  Hosier's  list  there  were  26  words 
that  could  not  be  rated  by  20  or  more  subjects.  These  words  are  listed  in 
Table  VII-3.  Some  of  the  words  also  exhibited  marked  bimodality  of  response, 
and  these  are  shown  in  Table  VII-4.  Complete  tables  showing  the  results  of 
this  study  were  privately  issued  by  Hosier  (1941b),  but  this  list  was  un- 
available for  review. 


Hosier's  research  also  studied  the  effect  of  usual  adverbal  intensives. 
A set  of  five  words  were  selected  and  these  words  were  repeated  with  each 
of  seven  intensives.  The  results  of  the  study  are  given  in  Table  VII-5. 

Four  of  the  seven  adjectives  selected  are  arranged  across  the  top  of  the 
table,  each  heading  a column.  The  fifth  adjective,  "indifferent,"  behaved 
atypically  because  of  ambiguous  associated  context.  Each  row  of  the  table 


TABLE  VII- 1 


Scale  Values  of  Standard  Set  of  Words, 
(Hosier,  1940) 


Stimulus 

Md. 

EOi 

kb 

Co'npU-tcly  unsatisfactory 

1.6 

O.M 

l.uO 

Wry  unsatisfactory 

2.r, 

0.75 

0.W 

Catastrophic 

2.5 

0.01 

0.81 

Treacherous 

2.7 

1.05 

0.62 

Menacing 

2.» 

1.14 

0.96; 

Pisc«>u  racing 

3.5 

1.42 

0.49 

Parnful 

2.« 

1.43 

0.54 

Urprohtible 

4.3 

1.72 

0.62 

Rejected 

4.6 

1.71 

0.54 

Disputable 

5.7 

2.42 

9.69 

Normal 

C.7 

2.47 

1.46 

Satiating 

6.2 

2.7# 

1.54 

Reconcilable 

6.3 

3.30 

9.75 

Blameless 

7.6 

3.64 

0.90 

Solscing 

S.O 

3.7# 

0.51 

Ordinary 

6.5 

3.63 

1.43 

Bonny 

2.4 

3.97 

0.61 

Decent 

t.S 

4.06 

0.61 

Preferable 

9.0 

4.30 

0.S5 

Prontable 

9.4 

4.40 

0.47 

Popular 

9.7 

4.55 

0.49 

Successful 

10.0 

4.65 

9.54 

Sublime 

10.3 

4.00 

OMi 

Superior 

10.4 

4.91 

6m\ 

Completely  agreeaUe 

10.1 

4.95 

111 

6M 

™1?1 

TABLE  VlI-2 

Scale  Values  of  Selected  Words 
(Hosier,  1941a) 


Stlmului 

Scale  Value  ^ 

Completely  uniatisfsctory 

000 

Repulsive 

O.SO  1 

Di  igrscef ul 

1.00  1 

Wrong 

1J» 

Unnecessary 

2.00 

Dirputsbie 

2.S4 

Evcutabic 

an  ' 

Average 

1.04  1 

Pardonable 

S.4I  . 

Comfortable 

4.0* 

Desirable 

4.S0 

Highly  agreeable 

S.02 

Divine 

S5# 

Very,  very  desirable 

5.44 

VII-? 


r 


TABLE  VII- 3 

Words  Marked  "Unable  to  Rate"  by  20  or  More  Subjects 
(Mosier  1941a) 


Abhorred 

Adverse 

Bonny 

Calamitous 

Cloying 

Debased 

Despicable 


Ecstatic 

Estimable 

Expedient 

Inflaming 

Iniquitous 

Noxious 

Odious 


Ominous 

Peerless 

Pernicious 

Persuasive 

Perverse 

Pestilential 


Propitious 

Satisfying 

Seductive 

Seemly 

Solacing 

Superlative 


TABLE  VII-4 

Words  Exhibiting  Marked  Bimodality  of  Response 
(Mosier  1941a) 


Acccpiahle 

Am.T/iiiK 

AppaUinf^ 

BoraMc 

6««vi(chfn^ 

Choice 

Important 

iDdiflereQt 


('ntnpiciely  iniitftrrcnt 
iotliffercnt 
H'mhlv  iiKiiHereni 

Quilt*  iniiiflrri'iit 
I nu>ually  inJifferent 
\'er>  indiifrrent 
Very,  virv  inJiffereni 

Inflaiuii.K 

Indikpcn^alile 


IrrciittfMc 

Norm*l 

PeerlvM 

SatUtMl 

Seductivt 

Sublime 

Tempo'nK 

tinlu 

Qn«pcat«hle 


Note.  Words  marked  with  asterisks  also 
appear  in  Table  Vll-1. 


TABLE  VII-5 

Scale  Values  as  Affected  by  Adverbial  Modifiers 
(Mosier  1941a) 


Modifier 

De*irai)lc 

Axrecabie 

Poor 

Uruaiialaciofy 

4.50 

4 19 

1.60 

1.47 

4.76 

4.45 

l.SO 

too 

Very 

496 

4.12 

1.11 

0.75 

Vntifually 

5.21 

4.S6 

0.95 

0.75 

Compiefely 

S.14 

4.96 

0.92 

000 

Hiizhly 

S.IS 

5.02 

S.42 

5.10 

0.95 

0.10 

Very,  very 

5.66 

5.14 

0.55 

0.25 

VII-3 


presents  the  scale  values  for  one  of  the  adverbial  modifiers  studied. 

Jones  and  Thurstone  (1955),  in  order  to  determine  the  degree  of  like 
or  dislike  denoted  by  an  adjective  or  phrase,  had  905  enlisted  personnel 
rate  51  descriptive  words  and  phrases  on  a nine  point  scale  anchored  with 
"greatest  dislike"  at  the  left,  "neither  like  nor  dislike"  in  the  center, 
and  'greatest  like"  at  the  right.  For  each  item  a scale  value  was  de- 
termined by  the  method  of  successive  intervals  and  a standard  deviation 
was  computed.  The  51  word  phrases  are  given  in  Table  VII-6. 

Myers  and  Warner  (1968)  conducted  a study  in  which  50  commonly  used 
statements  describing  product  taste  or  ad  effectiveness  were  rated  on  a 
21  point  Thurstone  equal  interval  scale  with  the  top  category  captioned 
"This  is  the  best  thing  I could  say  about  the  (peraon,  product,  or  ad)." 

The  bottom  and  opposite  category  was  "This  is  the  worst  thing  I could  say 
about  the  (person,  product,  or  ad)."  The  judges  were  25  housewives,  36 
business  executives,  40  graduate  business  administration  students,  and  25 
undergraduate  business  administration  students.  For  each  statement  the 
mean  scale  values  and  standard  deviations  were  computed.  The  50  statements 
are  given  in  Table  VII-7. 

Cliff  (1959)  reported  on  a study  which  derived  scale  values  for  150 
evaluative  words  and  phrases.  The  list  of  stimuli  used  15  unmodified 
adjectives  plus  all  combinations  of  them  and  nine  intensity  adverbs.  Two 
hundred  thirteen  students  in  introductory  psychology  courses  at  Wayne 
State  University,  183  at  Princeton,  and  129  at  Dartmouth  rated  the  words 
and  phrases,  on  an  11  point  scale  anchored  by  "most  unfavorable"  at  the 
left,  "neutral'  in  the  center,  and  "most  favorable"  at  the  right.  The 
referent  of  the  items  was  "favorable  or  unfavorable  opinions  about  people." 
Scale  values  were  derived  by  the  least  squares,  successive  interval  method. 
The  scale  values  of  the  adverb-adjective  combinations  are  shown  in  Table 
VII-8.  The  adverb  and  adjective  values  matrices  are  shown  in  Table  Vll-9 . 

Altemeyer  (1970)  conducted  two  studies  in  which  numerical  values  were 
assigned  adverb-verb  combinations.  In  the  first  study,  392  Introductory 
psychology  students  rated  eight  adverb-verb  combinations  on  a seven  point 
scale  with  values  from  minus  three  to  plus  three.  In  the  second  study, 

194  introductory  psychology  students  assigned  numerical  values  to  nine 
adverb-verb  combinations  on  a four  point  scale  ranging  from  zero  to  plus 
three.  Plus  three  was  labeled  either  "completely  agree"  or  "strongly  agree." 
The  mean  ratings  of  the  verbal  phrases  obtained  for  both  studies  are  listed 
in  Table  VII-10. 

Dodd  and  Gerberick  (1960)  presented  sets  of  word  phrases  to  groups 
of  subjects  who  were  to  place  each  item  on  a nine  point  scale.  For  each 
group  of  words  the  median  scale  position  was  calculated.  Table  VIl-11 
shows  the  scale  positions  for  34  phrases  rated  by  40  subjects.  Table 
VII-12  shows  the  median  scale  positions  for  47  intensity  phrases  tested 
in  series  context.  Table  VII-13  shows  the  findings  from  100  judges  for 


VII-4 


TABUS  VII-6 


Scale  Values  and  Standard  Deviations  of  Stimulus  Items 
(Jones  and  Thur stone , 1955) 


Item 

Scale 

Value 

SD 

Item 

Scale 

Value 

SD 

Ht  sl  of  all 

6.1S 

2.48 

Mildly  like 

.85 

.47 

lavorilc 

4,f)S 

2.11 

Fair 

.78 

.85 

Like  extremely 

4.16 

1 62 

.Acceptable 

.73 

.66 

Ijkc  intensely 

4.0S 

1.59 

Only  fair 

.71 

.64 

Kxccilcnt 

3.71 

l.Ol 

Like  slightly 

.69 

.32 

Neutral 

.02 

.18 

Wontlcrful 

3.51 

. .97 

Like  not  so  well 

-.30 

1.07 

Strongly  like 

2.90 

.69 

Like  not  so  much 

-.41 

.94 

Like  very  much 

2.91 

.60 

Dislike  slightly 

-.59 

.27 

Michty  fine 

2.88 

,67 

Mildiv  dislike 

-.74 

.35 

K';|ieciallj  goorl 

2.86 

.82 

Not  pleasing 

- .8.1 

67 

Highly  favorable 

2.81 

,60 

Don't  care  for  it 

- 1.10 

Like  very  well 

2.00 

,78 

Dislike  moderately 

-1.20 

.41 

Very  good 

2.50 

.87 

Poor 

-1..SS 

.87 

Like  quite  a bit 

2.32 

,52 

Dislike 

-1  58 

.94 

nnjov 

2.21 

.86 

Dtm’l  like 

-1.81 

.97 

Preferred 

1.98 

1.17 

Mad 

-2.02 

..80 

1.91 

.76 

1 Highly  unfavorable 

-2  16 

1.17 

Welcome 

1.77 

1.18 

1 Strongly  dislike 

-2..17 

.5,1 

Tasty 

1.76 

.92 

Dislike  very  much 

-2.49 

.64 

Pleasing 

1.58 

.05 

' Very  had 

-2.53 

M 

1 Terrible 

-3  09 

-9.S 

Like  fairly  \vell 

1 51 

.59 

} Dislike  intensely 

-3.33 

1.39 

Like 

1.35 

.77 

1 Lnath 

-3.76 

3,54 

Like  moderately 

1.12 

.61 

! Dislike  extremely 

-4.32 

1.86 

OK 

.87 

1.24 

1 

Average 

.8« 

1.08 

Despise 

-C.44 

3.62 

TABLE  VII -7 


Means  and  Standard  Deviations  of  Commonly  Used  Statements 
(Myers  and  Warner,  1968) 


ill  m 


M SD 


/ U IUIIVI  S 

M SD 


iJnitiH(niihmU’ 


i u»l 

■'ll 

1: 

(I 

I7i 

IS 

>7 

. X 

S7  ) 

P> 

15 

(I 

7K) 

IK 

•Mx 

fl 

67/ 

I .iniaslK 

:o 

1: 

iP 

S,l  1 

I.S 

64 

1 1 

6X) 

7P 

15 

(1 

17/ 

14 

70 

fl 

M7) 

1 1 1 im  lulouN 

p* 

.Ml 

1 I 

ll  1 

IM 

67 

(7 

PI; 

P> 

70 

fl 

IX/ 

IM 

47 

ll 

7S) 

Supi  ri> 

l‘) 

.'•tl 

(1 

PM 

PI 

00 

, ■» 

IP) 

14 

10 

(1 

'/.  1 

14 

fiO 

f2 

47) 

1 Hi  M( 

!'» 

•IP 

1 1 

74) 

IS 

77 

75) 

|4 

5X 

(1 

‘77 1 

14 

n 

fl 

47) 

W 1 1 iIk 

P* 

(Nt 

.7 

IS  . 

IM 

Ml 

PM 

P> 

OK 

1 1 

61  ) 

IM 

ftf) 

fl 

64  t 

i hiiNl.iiuinu' 

IS 

( 1 

‘I‘i  1 

Pl 

1| 

/ 

PI  > 

l'7 

5M 

f) 

-V./ 

14 

10 

(1 

15) 

1 Vv  X I'l  ion. t II'  v'»>od 

l.v 

'(1 

(7 

ifii 

17 

PI 

l-i 

17) 

17 

6M 

f2 

7(.) 

17 

KM 

(1 

77) 

f vhcnu  l\  >:ooil 

IM 

II 

• 1 

61  ) 

17 

4 1 

, 

04  ) 

17 

45 

)7 

71.) 

IK 

(Ml 

(1 

5fM 

\\ 

17 

IJ 

, •> 

>P  1 

17 

47 

. > 

45) 

IK 

45 

(1 

•/'/) 

17 

52 

, X 

10, 

17 

PS 

,7 

■n  ' 

16 

47 

44  1 

16 

7K 

f7 

l>) 

16 

7,) 

fl 

Mf), 

U 1 MM  t k.li'U  ill'OvI 

IP 

(>.S 

( 7 

PM 

17 

n 

61) 

17 

70 

|7 

.l.>) 

17 

OH 

(1 

M')i 

16 

(1 

Ms  / 

16 

61 

i7 

15) 

16 

r.0 

i7 

’ll 

16 

76 

f 1 

-51) 

\ 1 f \ jhhmI 

15 

1) 

(7 

77) 

|(> 

Ml 

(2 

57) 

(7 

IK) 

(7 

IHl 

16 

MO 

fl 

41) 

t uu 

14 

SP 

I7j 

|5 

61 

, X 

77) 

14 

fiO 

(4 

mil 

15 

47 

(2 

jivun! 

14 

41 

1 1 

761 

1 1 

64 

J 1 

'MM 

|5 

70 

(2  OK) 

15 

M) 

(1 

44/ 

( KMUi 

14 

x: 

. •» 

PSi 

14 

Ml 

{ 1 

75) 

14 

7H 

/.? 

77) 

14 

56 

fl 

‘ift) 

NUnkratclN  goiul 

14 

41 

. 

7P 

11 

47 

<7 

44) 

1 ■> 

ftO 

(7 

.55) 

1 1 

04 

/! 

44) 

JMcasafU 

4i 

. 

P6) 

14 

61 

( 

44) 

!4 

4M 

(2 

41) 

14 

IM 

f? 

14) 

Kt.ison.iHy  ^;ooi) 

i: 

. 

^>4) 

M 

M4 

i} 

.47) 

14 

H5 

(7 

14) 

14 

70 

< 1 

71) 

Nice 

I.' 

56 

, s 

11 1 

II 

41 

■ » 

74) 

17 

70 

(7 

65) 

1 1 

77 

fl 

77) 

1 airlv  t^ooJ 

1 1 

<16 

i7 

47 1 

II 

41 

)4 

MM 

17 

40 

|7 

2M 

1 1 

17 

f> 

II  ) 

Sli^’hlU  jimul 

1 1 

Mi 

/7 

P>» 

IP 

75 

( i 

) 1) 

1 1 

mm 

«7 

6M 

l> 

4 2 

II 

.57) 

Aitii’t.iMc 

II 

1: 

. 

S‘)  , 

10 

67 

1 1 

41) 

IP 

77 

i\ 

46) 

11 

10 

f2 

07) 

Aitra^’c 

III 

S4 

i 1 

55  1 

'1 

47 

( 7 

14 ) 

IP 

M7 

1 1 

44, 

10 

76 

<1 

O.S) 

All  riehi 

IP 

76 

,| 

47  > 

IP 

17 

( 1 

7M) 

IP 

4t 

,.> 

/>/ 

II 

4f) 

(1 

7ft . 

O K 

IP 

:s 

<1 

67) 

IP 

II 

, > 

l.V ) 

IP 

5S 

i7 

17) 

11 

7S 

(1 

71  . 

So  ‘>0 

IP 

PM 

<1 

M7) 

M 

Ml 

, X 

75  1 

4 

5j 

1 1 

17) 

10 

16 

(1 

15. 

Neutral 

u 

SP 

fl 

50) 

u 

56 

H 

'MM 

10 

I.S 

f7 

01  ) 

10 

52 

ft 

16) 

1 .nr 

y 

5: 

s 

IKt) 

») 

56 

1 1 

67  ) 

4 

70 

(7 

05) 

10 

74 

i7 

70) 

NU  ilint  re 

‘) 

44 

(1 

SO) 

s 

II 

74/ 

M 

'Mi 

(7 

16, 

4 

4ft 

(7 

.70) 

N«>l  vcr>  iiooil 

(x 

7: 

, > 

M7i 

fl 

17 

. 

41) 

6 

10 

i7 

05) 

7 

4: 

1 7 

07) 

Mi'vtcrauts  poor 

6 

•iJ 

(1 

61 1 

(x 

SI 

1 1 

5(M 

6 

7m 

1 1 

M7  ) 

7 

71 

(I 

. .5',  ) 

Kc.isofi.iH>  poor 

(x 

J s 

16  , 

ft 

\\ 

1 7 

PM 

5 

K7 

(1 

7M 

<X 

16 

.1 

.57) 

S|»cl)ll;  poor 

,s 

<1 

'H. 

7 

14 

, 1 

46) 

7 

:s 

(7 

(Ml) 

X 

4M 

1 1 

84  1 

l*i»or 

> 

76 

' 

P') 

S 

14 

. > 

S6  1 

4 

77 

,2 

5)  ) 

5 

74 

/I 

51  / 

1 .litis  pottr 

5 

6 1 

■ f 

f».s , 

ft 

f-7 

, 7 

SI) 

ft 

.’S 

il 

61) 

ft 

77 

(I 

7M 

1 nplias.inl 

s 

PI 

<7 

M7  ' 

1 

U. 

1 1 

P7 ) 

1 

ftS 

61  ) 

5 

>2 

1 > 

(Kt  1 

(Jnitc  piMif 

4 

Sft 

' 1 

1 1 • 

•1 

5ft 

< 7 

5S) 

4 

67 

\\ 

67) 

4 

56 

.1 

7.S  . 

Mail 

1 

MS 

1 7 

PI  < 

1 

f>7 

' 7 

.54) 

4 

M5 

il 

Ml  ) 

4 

74 

1 1 

MS  ' 

\ i I s I'.nl 

\ 

,'P 

7 

IP . 

: 

X X 

4|> 

> 

?fi 

,2 

16  1 

4 

OS 

.1 

. 5|)  / 

{ IUI^I1.|II\  poor 

1 

At 

* I 

IP 

1 

PM 

.74) 

4 

IS 

II 

ftS  1 

4 

>6 

(I 

57. 

\ 1 1 s poi>r 

1 

17 

tl 

17  < 

4 

) 1 

(7 

. 44) 

4 

IS 

(1 

) 

1 

(tS 

il 

. 57  1 

Kttnafk.iHs  p<M»r 

’ 

MS 

■ 1 

71 

7 

75 

(1 

7P  1 

4 

17 

1 1 

70) 

1 

47 

1 

ftS  1 

1 M.Ki.iptal''lc 

? 

(vl 

01  t 

1 

s\ 

i4 

.47  I 

4 

4S 

(7 

7',) 

5 

5ft 

1 

06  . 

1 sscplum.ilh  poor 

s_* 

1 

PI  1 

1 

14 

.7 

.7M 

4 

27 

(1 

,M7) 

1 

.57 

tl 

4f»  , 

1 xirctml'  poor 

PM 

■ 1 

Pi 

7 

Ml 

, 

IM 

4 

10 

il 

.77) 

4 

.71 

1 

76  1 

Aviliil 

} 

'77 

I 

'Of 

' 

75 

. 16) 

IM 

il 

77) 

1 

ftS 

(1 

K6t 

f crnhlc 

1 

7ft 

iP 

77  1 

7 

t X 

.7 

.61) 

05 

tl 

41) 

1 

MS 

1 

'4  1 

llftf  rtHi 

1 

IS 

(P 

K7  1 

1 

X X 

, 1 

51  ) 

1 

67 

1 1 

15) 

7 

(M) 

1 1 

-4,5' 

VII-6 


Obtained  Successive  Intervals  Scale  Values  of  Adverb-Adjective  Combinations 


TABLE  VII-9 


Adverb  and  Adjective  Value  Matrices 
(Cliff,  1959) 

Way 

ne 

Princeton 

Dar 

tmouth 

Actual 

Expected 

Actual 

Expected 

Actua  1 

Expected 

Adverb 

Va  lue 

Value 

Value 

Va  lue 

Value 

Value 

(Unmodified) 

1.000 

.987 

1 .000 

.993 

1.000 

.991 

Slightly 

.555 

1.000 

.559 

.999 

.538 

1.003 

Somewhat 

.685 

.997 

.719 

1.001 

.662 

.995 

Rather 

.846 

1.015 

.887 

1.014 

.843 

1.016 

Pretty 

.935 

.995 

.961 

.994 

.878 

.992 

Quite 

1.042 

.994 

1.109 

.988 

1.047 

.991 

Decidedly 

1.216 

.997 

1.231 

.996 

1 . 165 

.992 

Unusually 

1.291 

1.010 

1.324 

1.001 

1.281 

1 .010 

Very 

1.317 

1.008 

1.323 

1.007 

1.254 

1.002 

Extremely 

1.593 

.996 

1.546 

.997 

1 .446 

1.006 

Adjective 

Evil 

-1.246 

2.082 

- .989 

1.918 

- .993 

1.972 

Wicked 

-1.158 

1.952 

- .951 

1.848 

- .997 

1.910 

Contemptible 

- .913 

1.746 

- .826 

1.749 

- .882 

1.792 

Immoral 

-1.177 

1.936 

-.931 

1.878 

- .954 

1.910 

Disgusting 

- .806 

1.617 

-.801 

1.621 

- .902 

1.715 

Bad 

-1 .025 

2.032 

-.972 

2.051 

- .796 

1.907 

Inferior 

- .813 

2.008 

- .923 

2.077 

- .861 

2.037 

Ordinary 

- .078 

2 .083 

- .253 

2.100 

- .223 

2.182 

Average 

- .040 

2.121 

- .296 

2.254 

- .211 

2.195 

Nice 

1.007 

1.742 

.984 

1.842 

1 .011 

1.739 

Good 

1.078 

1.752 

1.158 

1.777 

1.075 

1.  /61 

Pleasant 

1 .001 

1.835 

1.050 

1.856 

.974 

1.860 

Charming 

.802 

2.136 

.895 

2.116 

.910 

2.013 

Admirable 

.983 

2.001 

1.170 

1.892 

1.086 

1.892 

Lovable 

.836 

2.173 

.912 

2.108 

.812 

2.207 

TABLE  VII- lO 


r 


Numerical  Ratings  of  Adverb-Verb  Combinations 
(Altemeyer,  1970) 


Adverb 

Study  1 

Study  2 

Disagree  l 

1 Agree  | 

Agree 

M 

1 

SO 

** 

r 

SD 

M 

SO 

Slightly 

-.64 

.38 

.67 

.36 

0.62 

I .31 

Substantially 

-2.17 

.51 

2.10 

.50 

2.08 

' 49 

Moderately 

-1.35 

.42 

1.47 

•41 

1.49 

1 .38 

Somewhat 

-.93 

.47 

.94 

.41 

.91  1 

1 -42 

Quite 

-2.16 

.57 

2.37 

.49 

2.23  1 

.46 

Considerably 

-2.17 

.45 

2.21 

.42 

2.18  1 

.40 

Perhaps 

-.43 

.46 

.52 

.46 

.44 

43 

Decidedly 

-2.76 

.43 

2.77 

.41 

2.74 

.47 

Mildly 

.98 

.41 

TABLE  Vll-11 


Scale  Positions  for  Thirty-four  Phrases 
(Dodd  and  Gerber ick,  1960) 


Degree  phrases,  tested 
out-of -context 

complete  

ilflioit  complete 

very  much  more 

much  more  ............ 

• lot  more  

• good  deal  more  ... 

more  

•omewbit  more 

■ little  more  

lUthtly  more  

DOW  

As  AT  PRESENT 

ili(htly  lets 

■ little  lets  

tomewhat  

less  

much  leu  

■ good  deal  leu 

a lot  leu 

very  little  

almoit  tMoe  

very  much  lett  

Done 


Median 


S.IS 

8.06 

8.02 

7.67 

7.50 

7.29 

6.35 
6.25 
6.00 
5.9* 
5.03 
5.00 
3.97 

3.96 
3.7y 
3.64 
2.55 
2.44 

2.36 
2.08 
2.04 

1.96 
I.II 


Temporal  frequency  phrases, 
tested  out-of -context 

always  

without  (ail 

often  

usually  

frequently  

DOW  and  then 

•ometiiiia  

oocaaionally  

aeldoa  

lately  

never  


8.99 

8.89 

7.23 

7.17 

6.92 

4.79 

4.78 

4.13 

2.4S 

2.08 

1.00 


Scale  Positions  of  47  Intensity  Phrases 
(Dodd  and  Gerber ick,  1960) 


r 


I 

I TABLE  VII-13 

[ 

[ Stability  of  Intensity  Phrases  in  Diverse  Contexts 

(Dodd  and  Gerberick,  1960) 


Intensity 

Phrase 

Issue 

i: 

number  of 
responses 

Scale 

position 

Mean  scale 
position 

Very  strongly 

1 

295 

8.96 

2 

197 

8.91 

8.92 

3 

161 

8.91 

Strongly 

1 

269 

7.01 

2 

162 

7.20 

7.11 

3 

271 

7.12 

Moderately 

1 

305 

4.78 

2 

189 

4.77 

4.82 

3 

ni 

4.92 

Indifferent 


(Insufficient  data) 


the  phrases  of  Subset  1 of  Table  VII-12  on  strength  of  feeling,  when 
presented  in  graded  series  and  as  applied  to  31  scale  statements  about 
three  issues.  The  three  issues,  respectively,  were;  resistance  to  start- 
ing a war;  drafting  of  women  for  militi.ry  service  and  defense  work;  and 
amount  of  government  control.  As  seen  in  the  table,  Dodd  and  Gerber ick 
found  that  the  diversity  of  context  does  not  appreciably  shift  the 
scores  of  the  intensity  phrases. 

In  a study  conducted  by  Anderson  (1968)  a sample  of  100  college 
students  rated  555  personality  trait  words  on  likableness  as  a personality 
characteristic.  The  words  were  on  a seven  point  scale  with  zero  being 
defined  as  "least  favorable  or  desirable"  and  6 as  "most  favorable  or 
desirable."  The  words  were  also  rated  for  meaningfulness  by  50  subjects 
on  a scale  that  ranged  from  zero  ("1  have  almost  no  idea  of  the  meaning 
of  this  word")  to  4 ("l  have  a very  clear  and  definite  understanding  of 
the  meaning  of  this  word") . Table  VII- 14  shows  the  list  of  words  in 
order  of  likableness.  The  first  entry  for  each  word  is  its  likableness 
value,  listed  in  the  column  headed  "L" . The  L value  is  the  sum  of  the 
ratings  of  the  100  subjects  so  the  mean  may  be  obtained  by  inserting  a 
decimal  point.  The  second  entry  of  the  table  in  the  column  headed  s^ 
is  the  variance  of  the  likableness  ratings. 

In  a recent  study  (Matthews,  Wright,  & Yudowitch,  1975)  a list 
of  141  adjective  phrases  showing  degrees  of  adequacy,  acceptability,  and 
comparison  were  administered  to  enlisted  men  and  officers  at  Fort  Hood. 

The  adjective  phrases  were  rated  on  an  eleven  point  scale  with  -5 
anchored  with  "most  unfavorable",  zero  anchored  with  "neither  unfavorable 
nor  favorable",  and  +5  with  "most  favorable".  Means  and  standard  devia- 
tions were  computed  for  each  adjective  phrase.  There  were  about  50 
usable  judgements  for  each  phrase.  The  results  of  this  study  are  shown 
in  Tables  VIl-15,  VII-16,  and  VII-17 . 

Currently,  The  U.S.  Army  Test  and  Evaluation  Command  (1973)  is 
carrying  out  a project  part  of  which  included  the  scaling  of  32  adjectives 
and  adjective  phrases.  Average  scale  scores  and  standard  deviations  were 
computed  for  the  list  of  adjectives.  The  32  adjectives  and  adjective 
phrases  are  shown  in  Table  VII-18. 

Simpson  (1944)  studied  the  commonly  held  meaning  of  20  words  de- 
noting frequency  by  having  335  high  school  and  college  students  respond 
to  how  many  items  out  of  100  each  word  in  a list  indicated.  The  results 
are  presented  in  Table  VII-19. 

A significant  study  conducted  by  Mittelstaedt  (1971)  compared  the 
results  of  the  Jones  and  Thurstone  (1955),  Cliff  (1959),  and  Myers  and 
Warner  (1968)  studies.  The  Jones  and  Thurstone  and  the  Myers  and  Warner 
studies  had  13  stimuli  in  common.  Values  for  the  same  13  stimuli  (treat- 
ing them  as  a "scale")  were  taken  from  each  of  the  Myers  and  Warner  groups 
(housewives,  executives,  graduate  students,  and  undergraduates).  Product 
moment  correlation  coefficients  between  the  Jones  and  Thurstone  "scale" 
and  the  values  for  each  of  the  Myers  and  Warner  groups  were  then  calcu- 
lated. The  results  are  shown  in  Table  VII-20.  Eleven  stimuli  in  Cliff's 


VlI-13 


A 


I 


study  were  the  same  as  11  In  the  Myers  and  Warner  study.  As  before  the 
11  items  for  each  of  Cliff's  study  groups  were  treated  as  a "scale" 
and  were  compared  with  a "scale"  constructed  using  the  same  stimuli  for 
each  of  the  four  Myers  and  Warner  subject  groups.  The  product  moment 
correlation  coefficients  for  each  Cliff  group  with  each  Myers  and  Warner 
group  are  presented  in  Table  VII“21.  As  may  be  seen  in  the  tables, 
Mittelstaedt  found  a remarkable  correspondence  among  the  scale  values 
from  the  three  studies  in  spite  of  differences  in  time,  place,  subjects, 
instrumentation,  instructions,  referents,  and  context. 


TABLE  VII- 14 


Ratings  of  Likableness,  and  Likableness  Variances  for  555  Conmon 
Personality  Traits  Arranged  in  Order  of  Decreasing  Likableness 

(Anderson,  1968) 


WoTll 

v,^ 

Word 

L 

' since. 0 

5Li 

.30 

con.;cicntjnuS 

431 

.82 

1 

.17 

resourceful 

4SI 

.74 

j uiulcrst.u'uiing 

al't 

..52 

alert 

430 

.65 

! iov.u 

517 

.60 

gooil 

430 

.Ob 

' tralliiul 

545 

,61 

witty 

480 

,.S1 

1 trusUv.Hlliy 

55') 

.62 

rti  ir-huadcd 

479 

.69 

1 

537 

,f'2 

k.a.lly 

479 

1.06 

1 (!c|Kfuial})c 

556 

.(.6 

admirable 

478 

.78 

j n;>ca-jr.imli'd 

550 

.,56 

p.tiicm 

473 

.70 

1 LiiHislUiul 

52') 

.47 

l.'.lontcd 

478 

,.S4 

1 \^isc 

.61 

pirccptivc 

477 

.81 

1 c.’nsi<  Urate 

577 

.76 

spirited 

477 

' C‘^'^‘''nalurcU 

527 

.52 

spf'rtsm.infikc 

477 

i.n 

527 

.66 

wi  1-m.inncrcd 

477 

1.05 

J mature 

522 

.66 

co-'pcMtivc 

476 

5’? 

.60 

ctiiicai 

476 

1.15 

521 

.73 

intellectual 

476 

.91 

Ulmi 

520 

.69 

vcri.alilc 

474 

.66 

iricutllv 

519 

.72 

capable 

471 

,63 

kiud-licarlcU 

51 1 

.87 

courageous 

471 

.85 

r.appy 

5M 

.77 

constructive 

468 

.46 

c'.can 

5M 

.99 

productive 

4oS 

.81 

intcrcbtin;; 

511 

.61 

jirogrcssive 

468 

.78 

ur.icldsh 

510 

.65 

individualistic 

40/ 

l.SO 

goou-humored 

507 

.73 

observant 

467 

.81 

lumor  liiic 

507 

.55 

ingenious 

466 

.75 

hunuTCus 

505 

.86 

lively 

466 

.75 

responsible 

505 

.76 

neat 

466 

.93 

cheerful 

501 

.S3 

punctual 

466 

1.26 

truElful 

5iU 

1.07 

logicual 

465 

.76 

ivarm-hcarlcd 

501 

.62 

prompt 

465 

1.16 

brn.id -minded 

505 

.50 

accurate 

■ 461 

.93 

gcr.t’.c 

503 

i.no 

sensible 

461 

.84 

\vcil-?i>‘hcn 

501 

.78 

creative 

4o2 

1.15 

C’iucnlvd 

500 

.73 

self-reliant 

462 

.96 

re.asor.aMe 

500 

.73 

tolerant 

461 

.91 

companionable 

■199 

.88 

amusing 

45)0 

.,39 

likable 

497 

.75 

clcan-cut 

460 

1.49 

trustini; 

497 

1.20 

gencrou.s 

459 

.89 

clever 

496 

.56 

sympathetic 

4.50 

1.05 

p'c  isani 

495 

.,S6 

energetic 

457 

.81 

c Hirtcous 

49( 

.9! 

h:ph'.‘=piritcd 

457 

.7.5 

fiUick-'AiUcd 

•191 

.78 

seif-controlled 

4.^0 

.69 

t.actful 

19! 

.,34 

tcr.-lcr 

456 

1.30 

h''lriful 

4)2 

.74 

active 

4.55 

.65 

appreciative 

402 

.78 

independrne 

455 

1.32 

1 if.iagin.itive 

492 

.96 

respectable 

45,5 

1.10 

cuist.anding 

492 

i.nn 

inventive 

453 

.86 

solf-disciplincd 

■I'll 

.75 

wholesome 

453 

1.14 

brilliant 

4')0 

.96 

conjtrnial 

452 

.82 

cniliisiaMic 

439 

.72 

cordial 

452 

.96 

IcvtMicadcd 

4R9 

.68 

cx[)crienccd 

451 

.76 

poiiic 

4S9 

1.11 

attentive 

4.50 

.84 

ori;;inal 

4SS 

.75 

cultured 

450 

.80 

smart 

453 

.65 

.Tank 

450 

1.10 

lnr,;iving 

4S6 

1.03 

purposeful 

4.30 

.36 

sbarp-witted 

4.36 

1.01 

decent 

4-19 

1.00 

i\cll-rcad 

436 

.67 

diligent 

419 

.82 

ambitious 

484 

1. 14 

realist 

449 

.94 

bi  ight 

4, S3 

.67 

C«gCf 

443 

,)50 

respectful 

4.33 

I.I7 

poised 

44.3 

.78 

eflicicnt 

4S2 

.94 

competent 

447 

.82 

good-lcmiKrcd 

4S2 

1.02 

realistic 

447 

.90 

grateful 

482 

1,00 

amiable 

446 

1.02 

(Table  continued  on  next  page) 


VI  I- 15 


Ratings  of  Likableness,  and  Likableness  Variances  for  555  Common 
Personality  Traits  Arranged  in  Order  of  Decreasing  Likableness 

(Anderson,  1968) 


U-,„I  J 

1 

*-  1 

'■  1 

VVuriJ  ‘ 

1.  I 

J* 

■lit 

1..30 

soft  lie irtcil  1 

.1,87 

I 09 

V'.-  M 'U.-. 

did 

.81 

dignilied 

,180 

1.115 

cnierl.iininf; 

■113 

.03 

»>)}i)<.»S'*phjcal 

380 

1.78 

ndvcnlurous 

411 

.00 

idva'istic 

1.15 

vivacious 

410 

.91 

S')/{  -sDoken 

381) 

1,0.1 

coin;»o‘iod 

410 

.87  1 

disrinlincil 

379 

121 

rc(:i\al  ( 

■),10 

.09  1 

‘cri'ius 

379 

ronuuUic 

4.10 

1.10 

tldi  die 

375 

70 

priMicii'ut 

4.18 

.70 

Ci/nvinring 

.174 

.76 

4.1S 

I ,17 

pvj  su  * >ivc 

371 

92 

skiiiui) 

•l.iS 

.Ml 

fiJiodic/tt 

373 

1.07 

cMvrprisiny 

4,17 

.70 

()u:ck 

373 

1 1.1 

cr.’.cious 

417 

1.01 

Siijdiisiicatcd 

372 

.9.5 

:vl>lo 

4.16 

.08 

llniltv 

372 

,75 

nice 

-^.id 

1. 28 

SvntimcfUal 

3-  1 

1. 10 

n;^'’LC;i!4c  i 

414 

.9,3 

objective 

1 3,0 

1,81 

Sliil'ul  1 

•idd 

..8.3 

nonconforming 

369 

! 1 .33 

Curious 

4.12 

1.13 

rikd'.U'ous  ] 

.100 

1 2,24 

nunicrn 

4.12 

.03 

, nK'slhcmatical  i 

.307 

' 1.01 

cl’.nni'iii" 

AM) 

98 

mcdiialive 

3f*0 

1.52 

social'Ic 

420 

' .85 

fearless  1 

3f'6 

1. 12 

modest 

42S 

1.2.3 

svstenutit  ] 

300 

1.12 

dr'isi\c  ' 

427 

1.03  1 

subtle  1 

3'»5 

1,00 

liuod»!c 

427 

1..H 

n'^rmal  i 

s3o2 

1.21 

u«iv 

427  j 

.82 

darimt  i 

3o0 

1 ,03 

popular 

4?.r, 

,98 

middleciass 

.ViO 

.99 

upright 

420 

I.Ol 

I'jckv 

3.5S 

1 .10 

U:cra?v 

42,4 

1.10 

proud 

3.3S 

1.00 

pr:»c:icM 

425 

.73 

sensitive 

35S  i 

2.00 

liuht'hvnrted 

424 

.99 

moralistic 

3.47 

2.13 

, v.cH-brcd 

421 

1.13 

talkative 

3.12  1 

1..12 

' rciir.cri 

422 

1.10 

1 excited 

351  1 

..80 

i svU  confident 

421 

.81 

1 moderate 

.351  1 

.00 

cool-headed 

420 

.97 

satirical 

.1.11 

1.18 

studious 

41S 

1.00 

! prudent 

34S 

1.71 

vcnlurcsiimc 

417 

.55 

1 reservi'fl 

34S 

1.00 

discreet 

410 

1.20 

persistent 

.147 

1.00 

informal 

410 

1,00 

meticulous 

340 

1.38 

th'iT-ough 

410 

.94 

unconventional 

340 

.92 

cxuilcrant 

414 

.97 

deliberate 

315 

1.40 

in(;uisilivc 

413 

1.17 

i painstaking 

345 

1.44 

ec.'rvtrolng 

412 

1.30 

I'old 

3.10 

1 22 

cut:'oii:g 

1 412 

l.iO 

suave 

3.?5  ! 

I.  .10 

'i.t-suiFicicn! 

412 

1..30 

caul i 'US 

3.11 

.77 

411 

111 

innocent 

332 

1.27 

con<l:lent 

411 

1.01 

inoffensive 

332 

.91 

moral 

411 

1.67 

shrewd 

328 

2.47 

tcjj-nssurcd 

411 

.72 

r.icthociic.al 

325 

1.54 

lintirinj 

410 

nonchalant 

324 

1.2.1 

!:0{>rfiil 

1 400 

.02 

sclf-coutcnted 

321 

2.04 

calm 

400 

.81 

pcrfcclionistic 

322 

1.09 

strong-minded 

101 

1.27 

for\Mird 

,118 

1.12 

[visitive 

•103 

1.28 

cxcitsiblc 

317 

1.15 

conf/ornt 

•lOI 

1.04 

out«jKkcn 

313 

1.77 

a: tistic 

400 

1.58 

prideful 

313 

1.99 

f’rccisc 

400 

1.0.3 

rplirt 

311 

,91 

scicnlific 

•100 

1.05 

impulsive 

307 

1.58 

orderly 

.309 

.84 

ai,'"rrS5ivc 

30 1 

I, -IS 

Social 

30.S 

1,05 

chani;cal)le 

207 

1,08 

direct 

.300 

1.07 

cnnscrvjlivc 

205  ' 

.92 

cr.rrfu) 

3yo 

,84 

shv 

291  1 

.89 

cindid 

389 

1.43 

j licsilarit 

290 

.76 

comical 

3.89 

1,09 

unpredictable 

290 

i.:o 

oiiliKinj; 

389 

1.53 

solemn 

2.S9 

.85 

Self-critical 

389 

1.55 

blunt 

287 

1.03 

fashionable 

387 

1,28 

self-righteuus  | 

2,87 

2.46 

rclipotts 

387 

1.93 

average 

284  j 

.90 

(Table  continued  on  next  page) 


VII-16 


TABLE  VII- 14  (cont.) 


Ratings  of  Likableness,  and  Likableness  Variances  for  555  Common 
Personality  Traits  Arranged  in  Order  of  Decreasing  Likableness 

(Anderson,  1968) 

1 


Word 

L 

t* 

Wtjr.J 

L 

J* 

2(t.^ 

3.48 

[ Spendthrift 

221 

.73 

2So 

1,23 

! icmt'cramcntal 

221 

1.10 

unluckv 

2S0 

.52 

giillililc 

219 

.88 

lu-hflil 

279 

.65 

indecisive 

219 

1 .90 

sclf-conccrncH 

270 

1.01 

sillv 

2.9 

1.53 

aull;orit:Uivc 

274 

l.Sl 

sul)missivc 

219 

.90 

274 

] 06 

iipv.ludious 

218 

1 .06 

rc-Nllc^s 

274 

.76 

preoccui)icd 

216 

1.12 

ch»'>o<;v 

272 

1.62 

louse 

215 

.90 

5<  lf-’K»‘?9CSS5C(l  1 

1 '7 ) 

2..53 

fearful 

214 

.69 

naive 

270 

l.Oo 

unronLanlic 

214 

1.33 

<*P|>i»rtunisl 

270 

2.47 

nl'scnt-mindccl 

213 

1,00 

tluMlric.il 

260 

1..50 

ijuprnclical 

213 

1.12 

unsophi>licatcd 

2('i7 

1.23 

wiilidrawn 

213 

.80 

iim>rcs.?ionablc 

2t)() 

.01 

unadventurous 

212 

.93 

oruinary 

206  1 

.n 

sarcastic 

210 

1.30 

strict 

206 

1.30 

sad 

209 

.93 

skcnlicrxl 

264 

1..52 

unemotional 

209 

1.50 

cx(iava::ant 

204 

..88 

u'orr\'ing 

209 

.71 

forceful 

203 

1.65 

lii;;h-strung 

208 

1.57 

cunnii^c 

202 

2.KS 

unoriginal 

207 

.81 

inexi)cricnccd 

202 

.66 

unpoi.scd 

206 

.76 

unincll»o(Jical 

202 

.86 

compulsive 

205 

1.20 

daredevil 

201 

1.23 

worrier 

205 

1.00 

wordv 

261 

1.05 

demanding 

203 

.94 

da\ dreamer 

260 

.05 

utihappy 

203 

.98 

conventional 

260 

.05 

indidcrent 

202 

1 31 

materialistic 

260 

1.66 

uncultured 

201 

1.00 

self-satisfied 

260 

2.00 

clumsy 

199 

,92 

rcbcilious 

258 

1.40 

insecure 

19S 

eccentric 

25- 

I 58 

uncetcrfaffiing 

' 19.5 

.65 

opinionated 

257 

l.OS 

imiiativc 

19.S 

1,17 

stern 

257 

1.10 

melancholy  ' i 

198 

1,13 

If»nc!v 

256 

1.02 

mediocre 

197 

l.IO 

dependent 

254 

1.97 

o!)stinatc  i 

197 

.94 

unsv.y.cnmtic 

253 

.92 

unhealthy 

107 

1.42 

self-conscious 

240 

.92 

hca'istrong 

106 

1.17 

undecided 

240 

.86 

nervous 

105 

.83 

resigned 

24S 

1.22 

nonconfident 

106 

.87 

clo.vnish 

247 

1.73 

stubborn 

196 

1,3! 

anxious 

246 

.90 

unimaginative 

193 

1.06 

conforming 

216 

1.26 

down-henrted 

194 

.97 

critical 

243 

1.46 

unol'servant 

191 

.90 

conformist 

241 

1.15 

inconsistent 

103 

.91 

radi'^al 

241 

l.M) 

unj'unctual 

192 

.96 

(lUsatisfic'l 

230 

1 .65 

unipflustrious 

191 

.81 

oki-hsbioned 

239 

1.39 

tii'turbcd 

ISO 

.97 

mcci^ 

23S 

1.37 

fupcrstilious 

189 

1.33 

frivolous  I 

217 

1 55 

frustrated 

188 

.93 

discontent'‘d 

237 

1.00 

illogical  1 

186 

.97 

troubled 

235 

.71 

rash 

185 

..59 

irrcli'-dous 

23  4 

171 

unenthusiastic 

ISO 

1.0.5 

overcautious 

220 

.55 

inaccurate 

185 

.59 

silent 

22S 

.S3 

noninquisitive 

184 

.90 

tou'^h 

228 

1.74 

unagreeable 

184 

1.08 

ungraceful 

22S 

.87 

jumpv 

183 

.73 

argumentative 

227 

1.25 

jjossc-ssivc 

183 

1.02 

wiili'irav.-ins 

227 

.78 

purposeless 

183 

1.90 

uninquisitive 

225 

.94 

mtjod  V 

182 

1.30 

forcitfiil 

221 

.85 

unenlcrjirising 

1,80 

.81 

inhibited 

224 

.87 

uniuLclIectual 

IN) 

117 

unskilled 

224 

,71 

unwise 

180 

.79 

crafty 

223 

1.98 

- oversensitive 

170 

.77 

passive 

223 

.97 

incflicicnt 

178 

.08 

immodest  i 

222 

I 61 

recklc.ss 

178 

1.42 

unpopidar 

222 

.80  i 

pompc'is 

177 

1.43 

timid  ' 

077  1 

.78  1 

urcongonial 

175 

.59 

(Table  continued  on  next  page) 


VII-17 


TABLE  VII-14  (cont.) 


Ratings  of  Likableness,  and  Likableness  Variances  for  555  Conmon 
Personality  Traits  Arranged  in  Order  of  Decreasing  Likableness 

(Anderson,  1968) 


WlTv. 

i '• 

[“7 

I Word  i L 

J* 

inti 

17.S 

.02 

1 tirc'onin 

MO 

1 .70 

un;icc.'m''d.'iting 

I7.J 

.(iS 

dis<>hrtiicnt 

123 

1 1.23 

ir.7 

.s.s 

C'lnpiiining 

127 

.74 

177 

'57 

llOk'ss 

127 

68 

Cvr.ic.ii 

1 171 

1.26 

vain 

127 

.99 

ifv’rv 

in 

.00 

h7.y 

126 

.8.8 

lijtic 

1 i('» 

.72 

unnpjircciatlve 

1 >6 

1 ■•''1 

i 1 

.61 

m;'.l:iilju5lcd 

123 

1 1.07 

ur,i'r.vl!i;;cnt 

1 m 

1 1 07 

aimless 

122 

1 1.16 

tloininccrin^j 

t()7 

l..)2 

boi*itful 

122 

1 .74 

scold' nr 

loo 

.67 

dull 

121 

1 

depressed 

Kjf, 

1.01 

gossipy 

119 

.06 

unoMiKinr 

Ki.S 

.SO 

unipncaling 

110 

l.Ot 

pcrsimistic 

161 

1.06 

li>*T)ochondriac 

ns 

.8.8 

umtlctUivc 

IM 

.74 

irrit.'iting 

118 

1 ■<' 

^oi*itcrous 

161 

1.10 

petty 

1 118 

1 

suspicious 

16.) 

.SS 

shallow 

1 118 

1 1.00 

iii.v.icnfivc 

1 162 

1.13 

dca'[)tivc 

! 117 

1 1. 01 

ON  crcontidcnL 

1 162 

grouchy 

.61 

s:mo' 

1 161 

.68 

cTolis!  ical 

1 116 

1 1.25 

I’.ns'  'ci.i^'lc 

I 161 

1.13 

iVK'ddlc'omc 

116 

1 .62 

mifoduclivc 

1 160 

.0.3 

uncivil 

116 

.06 

w .'.s;  vful 

l«l 

.67 

cold 

113 

.9  * 

i.=;6 

1.13 

unsportsmanlike 

113 

.72 

IK  tllTlfu! 

.50 

hof.sv 

112 

.,89 

i-Urnpcrcd 

I.V) 

ir)pU'.isi'ig 

112 

.71 

hot-hfidcd 

I.IS 

1. 00 

cowardly 

no 

..82 

1 uns 'ciil 

l.'S 

1.16 

discourteous 

no 

.80 

I envious 

1 07 

.77 

incompetent 

no 

.68 

. cNwcrilic.il 

i l.i7 

.S) 

childi'^h 

109 

.,81 

p«'lu!nin^ 

1 116 

1 ..30 

superlicial 

1U9 

.95 

1 150 

1 ..3S 

urrjr  itcful 

1 100 

.71 

1 

1,02 

.SL'lf-rniiu  ilcd 

1 I'l.s 

! 1.14 

foo’.lu'irdy 

i i.vt 

1.00 

i hir<Micarled  | 

K>7 

1.00 

injrr.iturc 

161 

1 X? 

unf-.ir 

107 

1.60 

' don;i''.;rlin<; 

I'i 

i 1.2S 

irres’ionsiblc 

106  I 

1.17 

shenvv  I 

i 111 

1 02 

prejudiced 

106 

1.33 

■ SloMpV 

I ' 

.0.6 

Or.ng.ging 

104 

.72 

' rnsympitlu’f ic  ^ 

1 1 ' ' 

1.32 

jealous 

lot 

. / 1 

uncojiiprotnising  ! 

! l.'.l 

1 1.26 

unpieisant 

104 

.81 

, liot-icnpcrcd 

' I.'^i 

1 .0(5 

' unicliibic 

104 

.93 

1 nnjrotic 

I, '2 

1.31 

im;iolitc 

103 

.72 

1 u-fp'irtin? 

1.i2 

.SO 

crt:dc 

102 

1.29 

1 finickv 

150 

, nosey 

102 

.07 

1 rc=c"tful 

150 

.90 

humorless 

lUl 

.82 

j unru’v 

150 

RS 

1 quarrelsome 

101 

.11 

1 f.-iult-firniins 

MS 

.06 

abusiyc 

100 

.83 

;nc5.iv 

117 

.7,3 

1 tlistui.ilfu! 

1 9') 

1.24 

1 nis'.l 

M7 

1.2.S 

intolerant 

98 

.97 

M6 

.78 

; unforgiving 

93 

.71 

scor.  ful 

14.) 

boi-ing 

97 

.76 

anllsocial 

144 

1.21 

unethical 

97 

.90 

imt  i''lc 

14.1 

.85 

urrcasnnahlc 

97 

.86 

slirr’.' 

M.5 

.60 

sclf-rcntcrcd 

96 

1.13 

t.lCtl.'SS 

142 

.85 

snobbish 

96 

.87 

c.ircic'j 

MO 

.91 

utrlsindlv 

90 

.64 

foolish 

140 

.S3 

ill  mannered 

95 

.76 

tr<’u!'!'  some 

140 

.73 

ill-tcmpcrctl 

95 

.62 

un.’raci  HIS 

140 

.71 

unfriendly 

92 

nc'u'i^^'-nl 

UO 

.6.3 

liostilc 

91 

.77 

wisliN'-wnshv 

I.M 

1.17 

di'^Hkablc 

PO 

.78 

)ir')fanc 

M7 

1 .65 

uilru-aitical 

00 

.98 

;;loanu' 

M6 

84 

oiTcnsivc 

ss 

.83 

hcl|)lc'!> 

M6 

1.12 

belligerent 

86 

.79 

disif;rccahlc 

134 

.6)7 

undcrh.indcd 

*6 

1.19 

touchy 

134 

.83 

annoying 

84 

.66 

irrationuK 

130 

,70 

disrespectful 

83 

.79 

(Table  continued  on  next  page) 


VII-18 


TABLE  VIl-14  (cont.) 


Ratings  of  Likableness,  and  Likableness  Variances  for  555  Common 
Personality  Traits  Arranged  in  Order  of  Decreasing  Likableness 

(Anderson,  1968) 


Word 

L 

]* 

Word 

L 

** 

83 

.87 

unkind 

66 

.71 

82 

.65 

untrustworthy 

65 

.63 

80 

.58 

deceitful 

62 

.96 

1.10 

dishonorable 

52 

.47 

.92 

malicious 

52 

.49 

78 

.88 

obnoxious 

48 

.60 

77 

.76 

untruthful 

43 

.43 

76 

.79 

dishonest 

41 

.51 

concfilcd 

74 

.84 

cruel 

40 

.54 

i;rccdv 

72 

.61 

mean 

37 

.48 

spiteful 

72 

.61 

phony 

27 

.30 

insulting 

69 

.86 

liar 

26 

.36 

insincere 

66 

.65 

VII-19 


TABLE  VII- 15 


Means  and  Standard  Deviations  for  Phrases  of 
Degrees  of  Adequacy 

(Matthews,  Wright,  and  Yudowitch,  1975) 


Phrase 

Mean 

SD 

Totally  adequate 

4.6^0 

.846 

Absolutely  adequate 

4.540 

.921 

Completely  adequate 

4.490 

.825 

Extremely  adequate 

4.412 

.719 

Exceptionally  adequate 

4.380 

.869 

Entirely  adequate 

4.340 

.863 

Wholly  adequate 

4.314 

1.038 

Fully  adequate 

4.294 

.914 

Very  very  adequate 

4.063 

.876 

Perfectly  adequate 

3-922 

1.026 

Highly  adequate 

3.843 

.606  . 

Most  adequate 

3.843 

.978 

Very  adequate 

3.420 

.851 

Decidedly  adequate 

3.140 

1.536 

Considerably  adequate 

3.020 

.874 

Quite  adequate 

2.980 

.979 

Largely  adequate 

2.863 

.991 

Substantially  adequate 

2.608 

1.030 

Reasonably  adequate 

2.412 

.771 

Pretty  adequate 

2.306 

.862 

Rather  adequate 

1.755 

.893 

Mildly  adequate 

1.571 

.670 

Somewhat  adequate 

1.327 

.793 

Slightly  adequate 

1.200 

.566 

Barely  adequate 

.627 

.928 

Neutral 

.000  '■ 

.000 

Border 1 ine 

-.020 

.316 

Barely  inadequate 

-1.157 

.638 

Mildly  inadequate 

-1.353 

.621 

Slightly  inadequate 

-1.380 

.772 

Somewhat  inadequate 

-1.882 

.732 

Rather  inadequate 

-2.102 

.974 

Moderately  inadequate 

-2.157 

1.017 

Fairly  inadequate 

-2.216 

.800 

Pretty  inadequate 

-2.347 

.959 

Considerably  inadequate 

-3.600 

.680 

Very  inadequate 

-3.735 

.777 

Decidedly  inadequate 

-3.780 

.944 

Most  inadequate 

-3.980 

1.545 

Highly  inadequate 

-4.196 

.741 

(Table  continued  on 

next  page) 

VIl-20 


TABLE  VII -15  (Cont.) 


Means  and  Standard  Deviations  for  Phrases  of 
Degrees  of  Adequacy 

(Matthews,  Wright,  and  Yudowitch,  1975) 


Phrase 

Mean 

3D 

Very  very  inadequate 

-4.460 

.537 

Exceptionally  inadequate* 

-4.560 

.637 

Extremely  inadequate 

-4.608 

.527 

Fully  inadequate 

-4.667 

.676 

Exceptionally  inadequate 

-4.680 

.508 

Wholly  inadequate 

-4.784 

.498 

Entirely  inadequate 

-4.792 

.644 

Completely  inadequate 

-4.800 

.529 

Absolutely  inadequate 

-4.880 

.431 

Totally  inadequate 

-4.900 

.412 

Note . * In'icates  duplicated  phrase. 


TABLE  VI I -16 


Means  and  Standard  Deviations  for  Phrases  of 
Degrees  of  Acceptability 
(Matthews,  Wright,  Yudowitch,  1975) 


Phra  se 

Mean 

SD 

Wholly  acceptable 

4.725 

.563 

Completely  acceptable 

4.686 

.610 

Fully  acceptable 

4.412 

. .867 

Extremely  acceptable 

4.392 

.716 

Most  acceptable 

4.157 

.915 

Very  very  acceptable 

4.157 

.825 

Highly  acceptable 

4.07  0 

.631 

Quite  acceptable 

3.216 

.956 

Largely  acceptable 

3.137 

.991 

Acceptable 

2.392 

1 .456 

Reasonably  acceptable 

2.294 

.722 

Moderately  acceptable 

2.280 

.722 

Pretty  acceptable 

2.000 

1.125 

Rather  acceptable 

1.939 

.818 

Fairly  acceptable 

1.840 

.724 

Mildly  acceptable* 

1.804 

950 

Mildly  acceptable* 

1.686 

.700 

Somewhat  acceptable 

1.458 

1.241 

Barely  acceptable 

1.078 

.518 

Slightly  acceptable 

1.039 

.522 

Sort  of  acceptable 

.940 

.645 

Borderline 

.000 

.200 

Neutral 

.000 

.000 

Marginal 

-.120 

.515 

Barely  unacceptable 

-1.100 

.300 

Slightly  unacceptable 

-1.255 

.589 

Somewhat  unacceptable 

-1.765 

.674 

Rather  unacceptable 

-2.020 

.836 

Fairly  unacceptable 

-2.160 

.880 

Moderately  unacceptable 

-2.340 

.681 

Pretty  unacceptable 

-2.412 

.662 

Reasonably  unacceptable 

-2.440 

.753 

Unacceptable 

-2.667 

1.381 

Substantially  unacceptable 

-3.235 

.899 

Quite  unacceptable 

-3.388 

1.066 

largely  unacceptable 

-3.392 

.818 

Considerably  unacceptable 

-3.440 

.779 

Notably  unacceptable 

-3.500 

1.044 

Decidedly  unacceptable 

-3.83/ 

1.017 

Highly  unacceptable* 

-4.220 

.576 

Highly  unacceptable* 

-4.294 

.535 

(Table  continued  on  next  page) 


VIl-22 


TABLE  VII- 16  (Cent.) 


Means  and  Standard  Deviations  for  Phrases  of 
Degrees  of  Acceptability 
(.Matthews,  Wright,  Yudowitch,  1975) 


Phrase 

Mean 

SD 

Most  unacceptable 

-4.420 

.724 

Very  very  unacceptable 

-4.490 

.500 

Exceptionally  unacceptable 

-4.540 

.607 

Extremely  unacceptable 

-4.686 

.464 

Completely  unacceptable 

-4.900 

.361 

Entirely  unacceptable 

-4.900 

.361 

Wholly  unacceptable 

-4.922 

.269 

Absolutely  unacceptable 

-4.922 

.334 

Totally  unacceptable 

-4.941 

.235 

Note . * Indicates  duplicated  phrases. 


VII-23 


Phra  se 

Best  of  all 
Absolutely  best 
Truly  best 
Undoubtedly  best 
Decidedly  best 
Bes  t 

Absolutely  better 
Extremely  better 
Substantially  best 
Decidedly  better 
Conspicuously  better 
Moderately  better 
Somewhat  better 
Rather  better 
Slightly  better 
Barely  better 
Absolutely  alike 
Alike  j 

The  same 
Neutral 
Borderline 
Margins  1 
Barely  worse 
Slightly  worse 
Somewhat  worse 
Moderately  worse 
Noticeably  worse 
Worse 

Notably  worse 
Largely  worse 
Considerably  worse 
Conspicuously  worse 
Much  worse 
Substantially  worse 
Decidedly  worse 
Very  much  worse 
Absolutely  worse 
Decidedly  worst 
Undoubtedly  worst 
Absolutely  worst 
Worst  of  all 


VIl-17 


‘viations  for  Phrases 
;ompar i son 

and  Yudowitch,  1975) 


Mean 

SD 

4.896 

.510 

4.843 

.459 

4.600 

.721 

4.369 

.823 

4.373 

.839 

4.216 

1.459 

4.060 

.988 

3.922 

.882 

3.700 

.922 

3.412 

.933 

3.059 

.802 

2.255 

.737 

1.843 

.801 

1.816 

.719 

1.15 

.776 

.961 

.656 

.588 

1.623 

.216 

.847 

.157 

.801 

.000 

.000 

- .061 

.314 

-.184 

.919 

-1.039 

.816 

-1,216 

.498 

-2.078 

.860 

-2.220 

.944 

-2.529 

1 .036 

-2.667 

1.423 

-3.020 

1 .038 

-3.216 

1.108 

-3.275 

1.206 

-3.275 

.887 

-3.286 

.808 

-3.460 

.899 

-3.760 

.907 

-3.941 

.752 

-4.431 

.823 

-4.431 

.748 

-4.510 

.872 

-4.686 

1.291 

-4.776 

1.298 

1-24 


TABLE  VI 1-18 


Scale  Scores  of 
Based  on  Over-All  A 
(USA.TECOM, 

Statement  s 

cceptability 

1973) 

Statement 

Average 

Standard 

Deviation 

Excel  lent 

6,27 

0,54 

Perfect  in  every  respect 

6.22 

0.86 

Extremely  good 

5.  74 

0.81 

Very  good 

5.19 

0.75 

Lnusually  good 

5.03 

0.98 

Very  good  in  most  respects 

4.62 

0.72 

Above  average 

4.56 

0.75 

Quite  satisfactory 

4.35 

0.95 

Good 

4.25 

0.90 

More  than  adequate 

4.13 

1.11 

About  average 

3.77 

0.85 

Satisfactory 

3.69 

0.87 

Moderately  good 

3.58 

0.  77 

Adequate 

3.39 

0.87 

Could  use  some  minor  change 

s 

3.28 

1.09 

Not  good  enough  for  extreme 

conditions 

3.10 

1.30 

Not  good  for  rough  use 

2.72 

1.15 

Not  quite  adequate 

2.40 

0.85 

Not  very  satisfactory 

2.11 

0.76 

Barely  adequate 

2.10 

0.84 

Not  very  good 

2. 10 

0.85 

Below  average 

2.03 

0,79 

L'nsatisfactuLy  but  usable 

2.00 

0.87 

Needs  major  changes 

1.97 

1.12 

Not  adequate 

1.83 

0.98 

Barely  acceptable 

1.79 

0.90 

Not  good  enough  for  general 

use 

1.76 

1.21 

Better  than  nothing 

1.22 

1.08 

Poor 

1.06 

1.11 

Very  poor 

0.  76 

0.95 

Very  unsatisfactory 

0.69 

1.32 

Extremely  poor 

0.36 

0.76 

TABLE  VII- L9 


Meaning  of  Frequency  Words 
(Simpson,  1944) 


757o  of  Students 
Thought  the  Term 
Meant  Less  Than  This 
Term Percentage  of  the  Time 


Always 

100 

Very  often 

93 

Usually 

90 

Often 

85 

Genera lly 

85 

Frequently 

80 

Ratlier  often 

80 

About  as  often  as  not 

50 

Now  and  then 

35 

Sometir.  s 

35 

Occasionally 

33 

Once  in  a while 

27 

Not  often 

20 

Seldom 

18 

Usually  not 

IS 

Hardly  ever 

13 

Very  seldom 

10 

Rarely 

10 

Almost  never 

5 

Never 

2 

J 


( 


•ilp— 


TABLE  VI 1-20 
Correlations  of 

Jones  and  Thurstone  and  Myers  and  Warner 
"Scale"  Values  for  13  Stimuli 
(Mittelstaedt , 1971) 


Myers-Warner  Groups 

r 

Housewives 

.992 

Executives 

.986 

Graduates 

.989 

Undergraduates 

.993 

TABLE  VII -21 


Correlations  of 

Myers-Warner  and  Cliff  Scale  Values 
for  11  Stimuli 
(Mittelstaedt,  1971) 


Myers-Warner  Groups 


Cliff  Study  Groups 


Wayne  State  Princeton 


Dartmout  h 


Housewives 

.990 

.990 

.987 

Execut ives 

.990 

.988 

.989 

Graduates 

.993 

.994 

.991 

Undergraduates 

.996 

.99? 

.995 

TABLE  VI I -22 


Summary  of  Studies  on  Perceived  Favorableness 
of  Commonly  Used  Words  and  Phrases 


Experimenter 

Type  of  Subiects 

No.  of 
Subiects 

Type 

of  Words 

No.  0 
Words 

Mosier  (1940, 
1941a,  1941b) 

Psychology  students 

140 

Adjectives 

289 

Jones  & Thurstone 
(1955) 

Army  enlisted  personnel 

905 

Adverbs 
Adject ives 

7 

51 

Myers  & Warner 
(1968) 

Housewives , 

Business  executives. 
Graduate  business 
administration  students. 
Undergraduate  business 
administration  students 

25 

36 

40 

25 

Adjectives 

50 

Cliff  (1959) 

Undergraduate  students 

537 

Adverbs 

9 

Altemeyer  (I'^yO) 

College  students 

586 

Adverbs 

8 

Dodd  & Gerberick 
(1960) 

Unknown 

40 

Adjectives 

81 

Anderson  (1968) 

College  students 

100 

Personality 
traits  words 

555 

USA,  TECOM  (1973) 

Unknown 

Unk. 

Adjectives 

32 

Simpson  (1944) 

High  school  and 
college  students 

100 

Frequency  terms 

20 

Matthews,  Wright, 
& Yudowitch  (1975) 

Army  enlisted  personnel 
and  officers 

51 

Adjective  Phrases 

141 

VII-28 


Table  VII-22  gives  a summary  of  the  studies  conducted  to  show  the 
perceived  favorableness  of  word  and  phrases.  As  can  be  seen  in  the  table, 
a large  variety  of  subjects  have  been  used  in  the  studies.  By  looKing  at 
the  tables  presented  in  this  chapter  common  words  can  be  found  across 
studies.  Hosier  (1941a)  showed  that  the  same  word  gets  the  same  rating 
if  it  is  repeated  in  a list,  which  implies  that  words  have  an  inherent 
meaning.  The  fact  that  words  have  an  inherent  meaning  or  perceived  favor- 
ableness independent  of  context  and  instrument  used  was  supported  by  Dodd 
and  Gerberick  (1960)  and  Mittelstaedt  (1971)  . 


Chapter  VIII 


CONSIDERATIONS  RELATED  TO  THE  PHYSICAL  CHARACTERISTICS 
OF  QUESTIONNAIRES 


This  chapter  considers  four  topics  related  to  the  physical  character- 
istics of  questionnaires;  the  location  of  the  response  alternatives 
relative  to  the  question  stem;  questionnaire  length;  format  considerations 
Such  as  color,  type  size,  spacing,  and  numbering;  and  the  use  of  answer 
sheets. 


Location  of  Response  Alternatives  Relative  to  Stem 

Only  two  articles  were  found  that  pertained  to  the  location  of 
response  alternatives  relative  to  the  question  stem.  Blumberg,  DeSoto, 
and  Keuthe  (1966)  had  over  100  subjects  rate  well-known  names  on  a variety 
of  traits,  using  a nine  point  scale.  They  concluded  that  untrained  raters 
can  make  relatively  error-free  ratings  without  being  influenced  by  whether 
or  not  the  "good"  end  of  a graphic  rating  scale  was  at  the  left,  right, 
top,  or  bottom. 

The  purpose  of  a study  by  Madden  and  Bourdon  (1963)  was  to  determine 
whether  mean  job  evaluation  ratings  would  differ  as  a function  of  seven 
variations  in  rating  scale  format.  One  of  the  variations  included  printing 
responses  vertically  or  horizontally.  Sixty  basic  airmen  rated  15  occupa- 
tions on  nine  job  requirements  for  each  format.  It  was  concluded  that  the 
rating  scale  format  .vas  a determiner  of  the  judgment  of  the  raters  in  the 
samp le  . 


Questionnaire  Length 

This  section  considers  the  effects  of  overall  questionnaire  instrument 
length  on  response  rate,  response  inconsistency, and  validity.  Disagree- 
ment was  found  on  the  effect  of  length  on  the  response  rate  of  mailed 
questionnaires.  Sletto  (1940),  in  a 300  subject  pretest  of  10,  25,  and 
35  page  mailed  questionnaires,  found  no  significant  effect  of  length  on 
response  rate.  Champion  and  Sear  (1969)  used  3,  6,  and  9 page  versions 
of  the  same  number  of  questions  spaced  so'  as  to  affect  apparent  length. 
Mailing  the  questionnaires  to  802  subjects,  their  results  contradicted 
Sletto' s findings  since  they  obtained  a greater  response  rate  with  the 
longer  questionnaire.  However,  the  overall  response  rate  was  only  357o. 

Three  other  investigations  concluded  that  the  response  rate  for 
mailed  questionnaires  is  greater  for  shorter  questionnaires.  Leslie 
(1970)  concluded  (without  reporting  data,  however)  that  one  or  two  page 
questionnaires  Improve  the  response  rate  for  mailed  questionnaires.  Ford 


VTII-1 


(1968)  found  a sliglitiy  increased  (but  nonsignificant)  response  rate  in 
a 1,656  subject  test  of  the  use  of  a printed,  folder-type  questionnaire,  as 
compared  with  a larger  appearing  mimeographed,  stapled  format.  One  versus  two 
page  mailed  questionnaires  were  tested  by  Bauer  and  Meissner  (1963). 

Tliey  found  that,  in  going  from  the  one  page  to  the  two  page  format,  non- 
response increased  from  negligible  to  over  5%.  They  also  found  that; 
absolute  correctness  of  responses  dropped  from  53.57<>  to  477«;  and  nonsense 
answers  increased  from  1.57.  to  5%,  Their  report,  however,  gave  insuf- 
ficient information  to  allow  the  reader  to  check  the  conclusions. 

The  effect  of  instrument  length  (in  terms  of  total  number  of  items) 
and  other  characteristics  on  response  inconsistency  was  studied  by  Ace 
and  Davis  (1972).  Using  177  college  sophomores > they  found  that  response 
Inconsistency  was  only  somewhat  influenced  by  length  and  format,  but 
considerably  influeiiced  by  the  type  of  scoring. 

There  have  been  a number  of  studies  on  the  effect  of  instrument 
length  on  validity,  but  since  they  were  concerned  with  cognitive  and 
achievement  tests, they  were  outside  the  main  scope  of  this  review.  For  example, 
Brokaw  (1951),  using  six  tests  administered  to  223  Air  Force  basic  airmen 
to  class  them  for  training  in  technical  specialties,  found  composite 
validity  against  course  grades  was  .56  for  half-length  tests,  .57 
for  ful 1- length  tests.  Battery  reliability  of  the  half-length  tests  was 
.90,  compared  with  .95  for  the  full-length  tests.  Since  the  tests  measured 
reasoning  and  knowledge  of  facts,  the  results  may  not  be  genera  lizab le 
to  questionnaires  as  defined  for  this  review. 

In  another  study,  Appel  (1959)  compared  true-false  and  forced  choice 
questionnaires,  each  administered  to  about  400  college  students.  He 
concluded  that  for  longer  forms  the  forced  choice  method  is  likely  to 
result  in  greater  va 1 idit y , whi le  for  shorter  forms  the  true-false  method 
is  likely  to  prove  superior. 

In  conclusion,  disagreement  was  found  on  the  effect  of  length  on 
response  rate  to  mailed  questionnaires,  little  information  was  found  on 
the  effect  of  length  on  response  consistency,  and  nothing  was  found  relating 
length  to  the  validity  of  questionnaires  as  defined  for  this  review. 


Questionnaire  Format  Considerations 

Lit*-le  specific  information  was  found  related  to  questionnaire  format 
considerations  such  as  type  size,  spacing,  color,  etc.  Sletto  (1940)  had 
47  students  rate  the  esthetic  appearance  of  10  different  questionnaire 
formats,  and  found  that  preferences  were  not  highly  individualistic  nor 
erratic.  Wolfe  (1956)  discussed  the  effects  of  layout  appearance,  arrange- 
ment of  questions  and  responses,  and  instructions.  He  noted  differences, 
but  provided  no  empirical  data.  Finally,  Lehman  (1967)  reported  that 
varying  the  length  of  a rating  scale  line  from  three  and  one-half  to  seven 
inches  appeared  unimportant  in  similarity  ratings. 


VII 1 -2 


Tha  Use  of  Answer  Sheets 


Several  articles  were  located  regarding  the  use  of  answer  sheets, 
although  this  topic  was  not  stressed  in  the  literature  review.  Dunlap 
(1940)  tested  serially  numbered,  repetitively  numbered,  articulated, 
and  unarticulated  answer  sheets  in  all  combinations,  using  20  groups  of 
fourth  and  eighth  graders.  The  sizes  of  the  groups  ranged  from  251  to 
364.  His  major  conclusions  were: 

1.  Marking  articulated,  repetitively  numbered  separate  answer 
sheets  is  equally  as  satisfactory  as  underlining  the  correct 
response . 

2.  There  is  evidence  that  repetitive  numbering  results  in  more 
errors  than  serial  numbering. 

3.  The  use  of  articulated,  serially  numbered  answer  sheets  is 
entirely  satisfactory  when  compared  with  the  results  in  using  the 
underlining  method. 

4.  The  use  of  unarticulated  but  serially  numbered  answer  sheets 
also  seems  justified.  There  was,  however,  a slight  difference  in 
results  favoring  articulated,  serially  numbered  answer  sheets. 

5.  Unarticulated,  repetitively  numbered  answer  sheets  are  some- 
what less  satisfactory  substitutes  for  the  underlining  type  of  test 
than  serially  numbered,  articulated  sheets. 

6.  There  is  no  evidence  that  the  separate  answer  sheet  cannot  be 
used  with  children  in  grade  levels  at  least  as  low  as  the  fourth. 

7.  There  is  no  evidence  to  support  the  contention  that  in  a multiple 
choice  test  there  is  a psychological  advantage  in  having  the  response 
Indicated  as  close  in  time  and  space  as  possible  (i.e.,  by  underlining) 
to  the  decision  as  to  the  correct  answer. 

8.  In  summary,  other  things  being  equal,  the  use  of  articulated, 
serially  numbered  answer  sheets  is  recommended,  particularly  if  the 
test  is  short  enough  to  enable  all  answers  to  be  recorded  on  a single 
side  of  the  sheet. 

In  a similar  study,  Faerber  (1951)  tested  230  students,  finding  a 
multiple  choice  test  with  a separate  answer  sheet  more  difficult  than 
open  answer,  right/wrong,  or  multiple  choice  without  a separate  answer 
sheet  when  the  tests  were  timed.  When  the  effects  of  time  were  removed, 
the  machine  scored  forms  (all  but  the  open  answer)  were  more  difficult  than 
the  open  answer  form.  A different  set  of  abilities  for  answering  machine 
scored  tests  was  hypothesized. 


VTTT-  1 


Bell,  Hobb , and  Hoyt  (1964)  compared  a standai  . iw.i  page  "lill  in  the 
mark"  machine  scored  ansv/er  sheet  with  a lu-w  cundeiised  niu-  page  answer 
sheet.  For  1,048  civilian  employees,  the  condensed  sheet  produced  signif- 
icantly lower  scores,  leading  the  authors  to  attribute  the  difference 
to  the  decreased  type  size.  They  concluded,  however,  that  measures  can 
be  taken  to  compensate  for  the  change.  In  a related  experiment  with  482 
subjects  using  cross-out  instead  of  fill-in  answering  on  the  condensed 
sheet,  no  significant  differences  were  found  between  the  one  and  two  page 
answer  sheets.  The  authors  did  not  examine  difference  in  subject  familiar- 
ity with  the  two  forms. 

A comparison  of  answer  sheets  was  also  made  by  Dizney,  Merrifield,  and 
Davi,p  (1966).  Using  an  arithmetic  test,  they  found  that, in  response  to 
each  of  three  questions , proportionally  more  students  using  the  IBM  1230 
format  reported  difficulty  in  using  the  answer  sheet  than  did  those 
students  using  the  older  IBM  305  format,  although  the  answer  sheets  were 
similar.  However,  a statistical  test  of  the  scores  of  those  reporting 
difficulty  using  the  two  formats  indicated  no  significant  differences. 

In  order  to  investigate  the  age  range  over  which  separate  answer 
sheets  could  be  used,  Solomon  (1971)  tested  116  inner  city  fourth  graders 
with  three  different  answer  formats  for  a reading  test:  answers  within 

the  booklet;  separate  hand  scorable  answer  sheets;  and  separate  machine 
scorable  answer  sheets.  No  statistically  significant  differences  were 
found.  For  an  older  age  group.  Hart,  Faust,  Rowland,  and  Lucier  (1964) 
recommended  the  use  of  optical  scan  and  reusable  booklets  with  graduated 
pages  whenever  possible.  Their  report,  us..ng  a sample  size  of  2,160, 
was  on  a study  of  the  attitudes  of  troops  in  the  tropics. 

In  a study  of  problems  related  to  the  use  of  answers  sheets,  Swordes 
(1952)  found  that  respondents  frequently  erred  in  using  the  1-st  space 
on  a multiple  choice  answer  form  when  there  were  more  spaces  than  actual 
choices.  Precautions  should,  therefore,  be  taken,  such  as  using  the  same 
number  of  distractors. 

Although  the  studies  reported  above  had  to  do  with  the  use  of  answer 
sheets  with  achievement  tests,  the  results  would  appear  generalizable  to 
the  construction  of  questionnaires. 


Chapter  IX 


CONSIDERATIONS  RELATED  TO  TliE  ADMINISTRATION 
OF  QUESTIONNAIRES 


Considerations  related  to  the  administration  of  questionnaires  are 
considered  in  this  chapter  since  sucii  matters  are  obviously  of  concern 
when  questionnaires  are  constructed.  The  effects  of  instructions  upon 
questionnaire  results  are  first  discussed,  followed  by  sections  on  the 
effects  of:  various  motivational  factors;  anonymity;  administration  time; 

characteristics  of  questionnaire  administrators;  administration  condi- 
tions; and  other  factors  such  as  bias  and  halo. 


Ef fe c t s of  Instruction s 


Several  studies  discussed  the  amount  of  variance  in  responses  due  to 
variations  in  giving  instructions.  Some  of  the  variance  in  instructions 
is  unintentional,  which  was  indicated  in  a study  conducted  by  Belson  (un- 
dated, a). In  that  study  236  tape  recorded  interviews  were  conducted,  in 
which  respondents  were  asked  to  use  the  semantic  differential  scaling  system. 
The  interviewers  were  told  to  deliver  the  printed  instructions  word  for 
word.  Analysis  of  the  tape  transcripts  showed  only  2%  of  the  instructions 
were  delivered  word  for  word.  Deviations  from  the  Instructions  took  the 
following  forms:  total  phrases  were  eliminated  with  considerable  ad  libbing;H 

and  key  words  intended  to  focus  the  respondent's  attention  on  some  specific 
part  of  the  instructions  were  frequently  omitted  or  changed.  The  deliveries 
were  rated  for  accuracy  in  presenting  the  34  basic  ideas  in  the  instructions 
in  the  average  delivery.  As  a result  28%  of  the  key  ideas  v)ere  lost,  main-  ^ 
ly  through  omission.  The  variability  of  the  interviewer  performance  varied 
substantially  both  across  interviewers  and  within  individuals. 

Madow  (1965)  stated  that  the  interviewer's  attitude  toward  the  question 
communicates  itself  sufficiently  to  the  respondent  to  alter  the  meaning  of 
the  question.  He  concluded  that  the  nature  of  the  survey  and  the  survey 
organization  are  determining  factors  in  whether  or  not  the  interviewer  must 
follow  the  interviewer  schedule  verbatim  or  may  vary  the  wording. 

Instructions  are  often  varied  in  experiments  to  induce  response  sets. 
These  experiments  usually  use  standard  instructions  and  instructions  to 
fake.  In  a study  by  Winters  and  Bartlett  (1966),  a forced  choice  scale 
was  constructed  to  provide  independent  measures  of  two  types  of  response 
tendency,  acquiescence  and  social  desirability.  The  scale  was  administered 
under  standard  and  faking  instructions.  Factor  analysis  yieldeU  a social 
desirability  factor  under  each  instructional  set,  and  an  acquiescence 
factor  only  under  standard  instructions.  Social  desirability  scores  were 
observed  to  be  orthogonal  between  ins  true t iona 1 conditions.  In  another 
study  conducted  by  Bartlett  and  Doorly  (1967)  using  a forced  choice  scale 
measuring  social  desirability,  tlie  authors  found  that  different  instruction- 
al sets  do  affect  the  tendency  to  answer  in  a socially  desirable  way. 

Lcderman  (1971)  administered  two  formats  of  the  Thorndike  Dimensions  of 
Temperament  to  college  students  under  regular  directions  and  under  instruc- 
tions to  give  socially  desirable  responses.  He  found  that  the  forced  choice 


IX-1 


forii’-u*'  prodiicfd  law  stale  iutercorre Ja tlons  ui.der  regular  dircctlynn,  but; 
under  the  dec  1 ral>l  1 1 Ly  <HrecLlonH  a connnon  factor  appeared,  lie  also  found 
that  the  same  factor  appeared  in  the  queat  I onna  I re  format  under  botli 
reuulur  and  deolrablllty  d I rect  lorio  . rreiich  (1958)  found  '.liat  algnlf  leant 
differences  were  obst*rved  in  mosi.  of  Llie  ucaleu  of  the  ''^dwards  Personal 
Preference  Schedule  under  different  1 no  true t lono  . 

Rainbo  (1968)  found,  ufilnf,  the  Illtickley  Scale  of  Attitude  toward 
Negroes  and  a I.lkcrt  ocale,  tliat  tlie  magnilude  of  tlie  linear  association 
between  the  scales  v/as  infl'ienced  by  tlie  lnsLruct-l.pns  presented  to  the 
subjects,  Frcderllcsen  and  Messick  (1958)  found  Hiat  Instructions 
altered  mean  crltlcalntss  oet  scores  In  the  expected  direction  to  an  extent 
tliat  it  was  significant  on  one  ,/f  the  three  tctxs  used,  nearly  significant 
on  another,  and  nono  Ignif  leant  on  the  tlilrd.  And  JJloxom  (1968)  found  tliat 
mildly  anger  arousing  printed  In/itnicUonf)  when  compared  with  non-o'^' 
arousing  instruc tio-’o  elicited  more  reoo<  rses  of  negative  oelf-rej' 

Jarrett  and  Sherriffa  (1956)  concluded  that  telling  people  to 
every  item  on  a questionnaire  or  to  omit  an  item  if  there  is  clearly  o 
difference  does  not  yield  different  results,  Miron  (1961)  found  tl'.at  un 
ing  subject?  to  answer  a retest  of  a questionnaire  the  same  vay  they  had 
answered  the  original  questionnaire  was  superior  to  nonrecall  conditions 
with  respect  to  mean  absolute  test-retect  deviations, 

Berger  and  Sullivan  (1970)  examined  an  hypotliesis  that  iristi  iictions 
emphasizing  a respondtm t ' s Importance  in  an  attitude  survey  would  result 
in  a reduced  number  of  "don't  khow"  responses  to  the  items,  A 20  item 
questionnaire  was  administered  to  180  undergraduates  under  tliree  contexts: 
face-to-face  Interviews;  teleplione  interviews;  and  group  administration, 
Contrary  to  the  hypothesis,  the  teleplione  Interviews  and  group  administra- 
tion cont<*xts  yielded  h Ign  I f icantly  more  "doii'l;  knows'  under  tlie  instructions 
emphasizing  the  respondent's  importance  than  under  the  control  instructions. 
There  was  no  diffeicnce  beeween  Instructional  sets  in  the  face-to-face 
context , 

From  the  above  dlscinslon  it  appears  that  Instructions  do  effect  the 
responses  collected  by  ques  1 1 onnxi  I res  , It  also  appears  that  more  syste- 
matic research  is  needed  to  determine  the  range  of  variations  in  instrec- 
tioriii  that  may  affect  the  results  gi^en  on  quostionnalres , and  the  effects 
of  va-lations  in  respondent  understanding  of  Instructions, 

# 

i: f f e cts  of«Varioiin  Motivational  Factors 


In  this  scclIoii,  the  effects  of  varloua  motivational  factors  aic  con- 
sidered. The  effects  of  a lack  of  respondent  motivation  will  first  bo 
briefly  considered.  Attention  will  then  be  given  to  factors  that  affect 
the  rate  of  return  of  questionnaires.  Respondent  preferences  for  certain 
item  formats  will  ncx*  be  reviewed,  followed  by  a discussion  of  the  effects 
of  the  behavior  of  the  administrator  on  questionnaire  response. 


4 


iX-7. 


K I' f c: I n <) r hi c: k of  ri L mo I . Some  of  the  uLiidles  deal  In)' 

will)  mo  t ( Vti  L f '.'HI  (ilticiifiM  tile  effect  of  luck  of  mo  ('1,  vat  Ion  on  re/jponfles  to 
f l eiiti) , F I a 1,1  j'. in  (195;j)  teHled  Llie  elleclii  of  motivation  in  a Btiidy  of  two 
gt-oiipa.  One  >'roii|>  'ac;;  ri'[iorted  an  liavloi'  lilfjli  motivation:  a Broiip  of  Air 

Force  AvL.itlon  cadet.i  taklni'  Llui  Aircrew  Job  Elcmento  Aptitude  TchCb.  The 
ntlier  f'.roiip  war  reported  aw  Iiavinj’,  low  motivation  and  conalotcd  of  cenlorH 
in  their  hint  two  wee'  a of  iichool  or  fitudentfi  involved  In  five  days  of 
testlnj;.  The  incidence  of  patterned  re.Tpnnfios  or  random  rccponscs  were 
li4^!ie**  In  Llioae  groiipn  where  the  motivation  was  Buspect. 

Hart,  FauoL,  Kowlnnd,  and  [.ucler  (I'KiA)  found  tliat  rcHpondente  who 
iiuule  tliree  or  more  co.iii  1 1, l ent  erroro  in  a HUidy.  to  noBeHfi  tlie  attltudeB 
of  troopK  1.1  ilie  tropica  v;ere  more  ncij'at've  in  attitude.  Levy -TvC boy er 
(•.955)  Blmllarly  fourul  tliat  omloalonfi  on  aehlevemen t and  intelligence  tCBtfi 
were  more  doe  to  motivation  at  the  moment  tluin  to  any  peraletlng  psych”'- 
loglcal  trait.  Finally  Kendall  (1954),  In  a Btudy  of  factore  which  Bceincd 
to  contr ll/ote  to  un.stuhle  rcHpomieo  people  make  to  attitude  qucBtlonnoircfl, 
found  that  ohlfts  in  the  mood  of  the  reflpondenth  and  the  degree  of  Intcrcot 
ill  or  concern  with  the  qiientlona  po.sed,  affected  reBiilts. 

Factora  a flee  tin)'  the  rate  of  return  of  quo  b t lonna  Iron . Some  of  the 
factors  that  affect  the  rate  ef  roi.nni  of  questlonrialrcs  arc  reviewed  below, 
Included  are:  the  effect  of  ego  Ineolvliif'  tlie  Biibjcct  in  the  study ; the 

uBf  of  advance  lecterfi,  covur  lettere,  and  other  tcchniqueB  to  stimulate 
the  return  of  mailed  cpies t limna Iren ; and  other  factora. 

Three  Htiulle  . ex.inilned  the  effect  of  ego  (,  /volving  the  aubject  in  Che 
Etudy,  Slocum,  En,K*>  and  Swanson  (1956)  found  that  efforts  to  establish 
an  image  of  the  1...1  utility  of  a survey  and  Co  emphasize  the  special 
role  of  each  retipo.icleo'  maximized  th'.!  rcBporiscB  to  a questionnaire  and 
structured  intervlev;,  Slel  to  (1940)  fciml  that  three  different  cover 
TetterB  did  not  h Ign  If  Icantly  affect  tho  rate  of  return  of  moiled  question- 
naires, One  cover  letter  leqtiestcd  help  on  an  aitruloflc  basis,  another 
on  a chnllenge  basis,  and  a third  c a "lielp  uh"  basis.  In  contrast  to 
the  .Sletto  (1940)  study,  Champion  and  Sear  (1969)  found  that  egoistic 
cover  letters  produced  greater  renponse  rates  than  altruistic.  This  was 
found  true  eapocl.ally  in  the  case  of  Tower  class  respondents.  Calm  Ian 
(1951)  conducted  a atiidy  In  which  n questionnaire  dealing  with  Army  interestb 
wan  mailed  to  1,051  Army  officers.  The  author  felt  the  84%  return  rate 
was  due  to  the  Interaction  of  tlie  institutional  control  applied  by  the 
Army,  or  tlie  traditional  rcsponsl hi  1 Ity  of  Army  officers,  with  the  wording 
of  the  cover  letter  wlilch  stressed  responuibl  llcy  and  requested  a return 
w I til  In  five  days. 

In  ezanilnatlnrifl  of  the  effect  of  letCors  sent  In  advance  of  the 
questionnaire  on  retiporifje  rates,  Ford  (l.9o7)  found  that  such  letters  sig- 
nlficuutly  Improved  reoponuo  rates.  Myers  and  Hang  (1967)  also  found  that 
an  initial  letter  Increased  response  rates  s (g,nlf  Icantly , lint  firunner  and 
Carroll  (1969)  found  that  tlie  Initial  letter  did.  not  slgniClcantly  reduce 
Interviev/  refiioai  rate. 


Glickman  (1962)  concluded  that  repeated  administration  did  not  have 
an  adverse  effect  on  the  proportion  of  subjects  returning  questionnaires. 
Durant  and  Maas  (1956)  also  found  that  people  previously  approached  respond 
ed  more  readily  a second  time. 

Several  studies  have  been  conducted  to  determine  what  techniques  are 
useful  to  stimulate  the  return  of  mailed  questionnaires.  One  technique 
involved  including  a stamped  and  addressed  return  envelope  with  the  question 
naire.  Ferris  (1951)  determined  that  including  the  stamped  and  addressed 
return  envelope  increased  the  response  rate  53%.  Clausen  and  Ford  (1947), 
however,  concluded  that  including  prepaid  return  envelopes  did  not  influence 
the  return  of  questionnaires. 

Another  technique  that  was  studied  was  the  followup  with  reminders  to 
complete  questionnaires.  Myers  and  Hang  (1967)  got  a response  rate  of  28% 
for  a group  that  had  a followup  letter  mailed  one  week  after  the  mailing 
of  a questionnaire,  and  a 28.97„  response  rate  from  a group  treated  similar- 
ly but  with  no  followup.  Clausen  and  Ford  (1947)  also  concluded  that 
followup  had  no  effect  on  response  rate.  Ferris  (1951)  similarly  concluded 
that  prodding  with  reminding  postcards  does  not  increase  response  rate.  But 
Watson  (1965)  did  find  that  followup  postcards  increased  response  rates 
from  30%,  to  40%,  and  two  day  follov^up  mailing  to  the  entire  sample  raised 
the  response  rate  from  307.  to  46%..  Leslie  (1970)  recommended,  without  data 
to  support  it,  the  use  of  second,  third,  and  fourth  mailings  and  followup 
with  personal  calls  to  improve  response  rates. 

A third  technique  which  was  studied  involved  the  type  of  postage  used 
to  send  the  questionnaire.  Watson  (1965)  obtained  the  same  response  rate 
using  air  mail  and  titird  clast  mailing.  Clausen  and  Ford  (1947)  concluded 
that  special  delivery  did  not  influence  response  rate.  Champion  and  Sear 
(1969)  came  to  the  same  conclusion. 

The  effect  of  incentives  on  return  rates  of  mailed  questionnaires  has 
been  studied  by  Brennan  (1958).  He  tested  the  hypothesis  that  trading 
stamps  would  be  an  effective  incentive  to  improve  the  response  return 
rates  in  mail  surveys.  A questionnaire  was  sent  out  without  incentives, 
with  50  trading  stamps  included,  or  with  the  premise  of  100  trading  stamps 
or  25d  upon  return  of  the  completed  questionnaire.  The  results  showed  no 
significant  diffeiences:  the  average  return  rate  under  all  three  conditions 

was  approximately  27%.  Watson  (1965),  however,  received  a return  rate  of 
40%  for  a lOc  incentive,  of  417.  when  a packet  of  stamps  was  used,  and  of 
48%  when  a 25d  incentive  was  used. 

Ferris  (1951)  found  that  responses  were  most  frequently  mailed  on 
Thursday  and  Friday,  lea.=  t frequently  on  Saturday  and  Sunday.  In  turn, 
Leslie  (1970)  suggested  mailing  questionnaires  so  that  they  arrive  on  Thurs- 
day or  Friday. 

Simon  (1967)  investigated  the  effect  of  personally  typed  cover  letters 
versus  mimeographed  form  letters  on  response  rate  in  mail  surveys.  He 


rx-4 


r 


found  that  there  was  no  overall  clearcut  advantage  for  personally  typed 
cover  letters  in  terms  of  respot.se  rate.  He  also  noted  the  possibility 
that  in  some  cases  personally  typed  cover  letters  may  reflect  lack  of 
anonymity  and  may  therefore  decrease  response. 

Ford  (1968)  demonstrated  that  printed,  folder-type  questionnaires 
generated  higher  responses  than  mimeographed,  stapled  questionnaires. 

This  supports  Leslie  (1970)  in  the  suggestion  that  long  questionnaires  be 
printed,  not  mimeographed,  because  printing  reduces  the  length  by  two- 
thirds.  However,  Durant  and  Maas  (1956)  found  that  having  to  fill  in  two  ques- 
tions did  not  greatly  increase  cooperative  response  over  having  to  fill  in  a 
53  item  questionnaire,  which  suggests  that  length  may  not  be  related  to 
motivation  for  returning  a questionnaire. 


Respondent  preferences  for  certain  item  formats.  Several  studies 
demonstrated  subjects'  preferences  for  specific  types  of  item  formats.  In 
some  cases  this  preference  did  not  seem  to  have  an  effect  on  the  results. 
Steinbeck  (1972)  found  that  a format  in  which  subjects  rated  themselves  on 
given  items  on  a nine  point  scale  was  more  acceptable  than  a format  in 
which  they  had  to  select  items  most  or  least  like  themselves.  Zavala  (1965) 
found  that  raters  prefer  forced  choice  formats  using  four  favorable  items 
from  which  they  choose  the  items  most  characteristic  of  the  person  rated. 
Waters  and  Wherry  (1961a)  determined  that  subjects  were  more  favorable 
toward  a response  format  allowing  them  to  indicate  the  degree  of  applica- 
bility of  each  statement  in  the  forced  choice  pairs  than  they  were  towards 
other  forced  choice  formats.  Waters  (1966)  also  reported  that  a subject's 
reaction  to  a forced  choice  scale  is  more  favorable  when  some  method  is 
incorporated  whereby  he  is  given  the  opportunity  to  indicate  the  degree  of 
applicability  of  each  item  to  himself.  Gaito  (1962)  speculated  that  forced 
sort  Q-sorting  techniques  may  adversely  influence  a subject's  spontaneity. 
Turgut  (1963)  showed  that  57%  of  a group's  subjects  preferred  the  format 
of  Edwards  Personal  Preference  Schedule,  and  327  liked  the  Q-sort  format. 
Jones  (1968)  showed  that  subjects  clearly  preferred  multiple  category 
options  over  two  category  options.  Subjects  also  reported  that  multiple 
choice  and  true-false  continuums  were  more  interesting  than  the  dichoto- 
mous true-false  format.  Hughes  (1967)  reported  that  a check  list  was  pre- 
ferred over  the  semantic  differential  when  both  were  first  administered  to 
subjects,  but  preference  for  the  semantic  differential  increased  from  11% 
to  347o  in  a retest  situation  while  the  preference  for  the  check  list  de- 
clined from  57%,  to  407=.  Hughes  attributed  the  increased  preference  for 
semantic  differential  to  the  respondents  becoming  more  familiar  with  it. 
Matell  (1970)  suggested  that  in  constructing  Likert-type  scales,  that  the 
number  of  steps  should  be  chosen  by  the  respondent's  preference. 


Effects  of  the  behavior  of  the  administrator  on 
studies  concluded  that  reinforcing  behaviors  of  the 
ministrator,  or  data  collector  have  an  influence  on 
ed.  The  effect  of  the  experimenter's  influence  was 
the  responses  first  gradefs  from  middle  and  working 


response . Several 
interviewer,  test  ad- 
the  responses  collect- 
studied  by  measuring 
class  families  made 


• ■ • 


r 


under  the  conditions  of  reinforcing,  neutral,  and  non-supporting  atmos- 
pheres induced  by  the  experimenter  (Sgan,  1967).  Results  indicated  that 
middle  class  children  were  more  susceptible  to  the  experimenter's  influ- 
ence and  that,  under  a reinforcing  atmosphere,  they  were  significantly 
more  apt  to  change  their  preferences.  Stember  and  Hyman  (1949)  conclud- 
ed that  interested  respondents  were  more  subject  to  interviewer  effects 
than  uninterested  ones.  Wickes'  (1956)  findings  suggested  that  such 
comments  as  "good"  or  "fine"  and  such  actions  as  smiling  and  nodding  by 
examiners  have  a decided  effect  upon  test  results. 

Marquis,  Marshall,  and  Oskamp  (undated)  reported  that,  although  re- 
spondents liked  the  interview  more  when  the  interviewer  was  supportive, 
his  manner  had  no  effect  on  the  accuracy  or  completeness  of  the  responses. 
However,  Marquis,  Canneil,  and  Laurent  (1972)  reported  that  the  interview- 
er's use  of  reinforcement  increased  the  accuracy  of  reports  from  respond- 
ents who  had  not  completed  high  school,  and  had  the  opposite  effect  on 
those  who  had.  Field  (1955)  found  that  praised  respondents  in  public  opin- 
ion interviewing  situations  tended  significantly  to  offer  more  answers 
than  the  unpraised  ones.  Praising  respondents  tended  to  reduce  "don't 
know"  answers,  but  praising  did  not  increase  insincere  or  dishonest  responses. 

In  a study  conducted  by  Hildum  and  Brown  (1956),  it  was  found  that 
"good"  proved  to  bias  results  in  a phone  attitude  survey  while"mm-humm"  did 
not.  Matarazzo,  et  al  (1964)  reported  a 317,  increase  in  subjects'  averaee  dura- 
tion of  single  utterances  when  the  interviewer  said  "mm-hmm"  all  the  time  f he 
subject  was  talking.  In  a cross  validation  study  there  was  an  8U%  increase 
in  the  mean  duration  of  single  units  of  interviewee  speech.  Dixon  (1970) 
used  subjects  who  were  high  or  low  on  a social  desirability  scale  in  an 
experiment  using  reinforcement  to  increase  se 1 f-referent  statements.  The 
reinforcement  was  done  by  the  interviewer  saying  "good"  after  every  sentence 
using  "I"  or  "We."  High  social  desirability  subjects  responded  to  reinforce- 
ment by  increasing  equally  the  frequency  of  both  positive  and  negative  self- 
referent statements.  Low  social  desirability  subjects  did  not  condition, 
but  continued  to  make  more  positive  than  negative  self-references . 


Effects  of  Anonymity 

Several  studies  have  been  carried  out  to  determine  the  effect  of 
anonymity  on  questionnaire  responses.  Pearlin  (1961)  carried  out  a study 
in  which  he  found  that  people  selecting  anonymity  in  filling  oul  a ques- 
tionnaire had  different  characteristics  than  those  who  did  not.  A ques- 
tionnaire was  administered  to  the  nursing  force  of  a large  Federal  mental 
hospital.  Respondents  were  given  the  option  of  anonymity.  It  was  found 
that  those  selecting  anonymity  were  no  more  negative  in  their  attitudes 
on  a number  of  critical  issues  than  were  those  who  signed  their  question- 
naires. Anonymous  respondents  were  more  subject  to  feelings  of  incompetto'-e 
as  reflected  by  their  low  scores  on  a measure  of  self-regard,  by  their 
reluctance  to  voice  opinions  at  work,  and  by  their  reported  difficulty  in 
coping  with  the  questionnaire.  A second  distinguishing  characteristic  of 
the  anonymous  respondents  was  their  generally  cautious  view  of  people  about 
them  and  their  motives.  Finally,  it  was  shown  that  the  anonymous  respond- 
ents had  less  involvement  and  interest  than  signers  in  the  issues  covered 


LX-6 


by  the  questionnaire.  Based  on  these  findings,  Pearlin  suggested  that 
anonymity  in  administration  is  useful,  but  for  reasons  other  than  the  pre- 
vention of  the  arousal  of  fear  or  threat. 

A few  studies  provide  evidence  that  anonymity  is  affected  by  more 
than  signing  or  not  s'gning  a questionnaire.  Wiseman  (1972)  noted  that 
questionnaires  provide  more  anonymity  than  interviews.  Metzner  and  Mann 
(1952)  conducted  a study  in  which  a fixed  alternative  questionnaire  was 
compared  to  an  open-ended  interview  with  328  employees  in  an  electric 
utility  plant.  The  subjects  were  given  an  attitudinal  questionnaire  about 
their  job  with  five  scaled  responses  to  choose  from,  followed  two  months 
later  with  a personal  interview  asking  similar  questions.  The  respondent's 
anonymity  was  assured  both  times.  Blue  collar  workers  were  more  confident 
of  the  anonymity  of  the  questionnaires,  while  white  collar  workers  felt 
the  interviews  were  not  less  anonymous  than  the  questionnaires.  In 
general,  the  interviews  yielded  higher  proportions  of  satisfied  responses 
than  the  questionnaires. 

Knudsen,  Pope,  and  Irish  (1967)  collected  data  from  samples  of  pre- 
maritally  pregnant  white  women  by  three  methods.  The  first  sample  anony- 
mously completed  questionnaires  in  their  physician's  office,  the  second 
sample  was  interviewed  confidentially,  and  the  third  filled  out  question- 
naires in  the  presence  of  an  interviewer.  Data  suggested  that  in  the  inter- 
view situations  the  respondent  was  more  likely  to  support  the  public  and 
restrictive  sexual  norms  that  she  assumed  were  adhered  to  by  the  interview- 
er. Lower  socioeconomic  respondents  deferred  to  the  norms  represented  by 
the  higher  status  interviewers.  In  the  private  and  anonymous  questionnaire 
situation,  the  respondents  more  often  answered  to  subcultural  norms. 

Pimon  (1967)  , in  his  article  on  personally  typed  cover  letters  versus 
mimeographed  form  cover  letters  in  mail  surveys,  advanced  the  possibility 
that  in  some  cases  the  personally  typed  cover  letters  may  reflect  a lack 
of  anonymity  to  respondents  even  though  it  is  assured  because  the  letters 
are  addressed  to  them  personally. 

Hamel  and  Reif  (1952)  e.xplored  the  question  of  differences  in  response 
due  to  signing  or  not  signing  the  Employee  Attitude  Questionnaire.  They 
found  that  essentially  the  same  responses  were  obtained  for  individuals  in 
identified  or  anonymous  groups.  They  speculated,  however,  that  these 
results  may  have  been  influenced  by  the  fact  that  the  staff  of  a university 
organization  administered  the  questionnaires  and  respondents  were  repeat- 
edly assured  that  the  questionnaire  would  only  be  used  for  confidential 
research  purposes. 

Dunnette  and  Heneman  (1956)  investigated  the  effects  on  attitude 
responses  of  the  identity  of  the  survey  administrator.  Two  employee 
samples  were  selected  randomly  from  the  total  work  force  of  a large  de- 
partment store.  The  IRC  Employee  Attitude  Questionnaire  was  administered 
to  one  group  by  an  Industrial  Relations  Center  staff  member  and  to  the 
second  group  by  the  personnel  manager  of  the  store.  The  group  which  was 
given  the  questionnaire  by  the  manager  responded  more  favorably  to  the 
attitude  survey  than  the  other  group.  The  same  group  tended  to  give  few- 
er and  shorter  responses  to  open-end  questions  than  the  employees  who  were 
given  the  questionnaire  by  the  Industrial  Relations  Center  staff  member. 


TV  - T 


Klein,  Maher,  and  Dimnington  (1967)  compared  attitude  survey  responses 
between  identified  and  nonidentlfied  manufacturing  employees  made  under 
two  conditions  of  identification.  One  condition  involved  a face-to-face 
designation  by  the  respondent's  manager  as  to  which  group  he  was  to  be  in 
(high  threat),  and  the  other  involved  a random  allocation  as  the  respondent 
entered  the  testing  room  (low  threat).  All  subjects  were  assured  confidentially 
of  their  responses  and  the  nonidentif ied  respondents  were  assured  anony- 
mity. A positive  distortion  in  responses  took  place  under  both  identified 
conditions,  but  signif icantly  more  under  high  threat. 

Bartlett  and  Sharon  (l969)  determined  the  effects 
of  several  instructional  rating  conditions  on  leniency  on  a graphic  and 
forced  choice  rating  scale.  Approximately  1,000  undergraduate  psychology 
students  rated  their  instructors  under  instructional  conditions  indicating 
that  the  ratings:  will  be  anonymous  and  will  be  used  for  research  purposes 

only;  may  be  used  for  evaluation  purposes;  will  be  identified  by  having 
the  rater  place  his  name  on  the  rating  form;  or  will  have  to  be  explained 
to  the  ratee  by  the  rater.  A significant  leniency  effect  was  found  with 
the  graphic  ratings  which  were  to  be  used  for  evaluation  purposes  and 
those  that  had  to  be  justified  to  the  ratee.  It  was  concluded  that  the 
forced  choice  scale  was  quite  resistant  to  leniency  bias,  however. 

Some  investigators  found  no  differences  in  anonymous  versus  non- 
anonymous  conditions.  Edwards  (1957a)  found  that  assurance  of  anonymity 
did  not  eliminate  or  drastically  change  the  nature  of  the  relationship  pre- 
viously found  between  probability  of  endorsement  and  social  desirability 
scale  value  where  the  assessments  were  not  made  anonymously.  Ash  and 
Abramson  (1952)  concluded  that  the  verbally  expressed  attitudes  of  college 
students,  as  recorded  on  scales  relating  to  ethnocentrism,  political- 
economic  conservatism,  and  anti-Negro  prejudice,  were  not  biased  in  either 
a more  'pro'  or  more  'anti'  direction  as  a result  of  the  requirement  that 
they  sign  the  scales,  thus  identifying  themselves.  Gcrberich  and  Mason 
(1948)  found,  in  the  administration  of  a questionnaire  on  academic  back- 
ground, plans,  and  study  habits  to  2,876  students  taking  a biological 
science  course,  that  there  were  no  significant  differences  between  signed 
and  unsigned  questionnaires.  As  mentioned  above,  Hamel  and  Reif  (1952) 
also  found  no  differences  in  signed  versus  unsigned  questionnaires.  But 
Corey  (1937)  found,  in  the  administration  of  a questionnaire  on  cheating, 
that  mean  scores  reflected  a slight  but  statistically  iusignif icant 
tendency  for  more  sympathetic  attitudes  toward  cheating  to  be  expressed 
on  anonymous  papers. 

Some  studies  have  indicated  that  different  results  were  obtained  in 
anonymous  versus  nonanonymous  situations.  Fischer  (1946)  gave  a psycho- 
logical problem  checklist  to  102  female  psychology  students,  first  with 
signatures  required,  then  a week  later  without  signatures  required.  The 
results  indicated  that  the  mean  number  of  problems  listed  did  not  vary 
significantly  under  the  two  conditions,  but  the  mean  number  of  serious 
problems  listed  tended  to  be  significantly  greater  when  signatures  were 
not  required.  In  a study  conducted  by  Olson  (1936)  a personality  test  to 
measure  emotional  instability  was  given  to  two  comparable  groups  of 


IX-8 


college  women,  one  group  remaining  anonymous,  the  other  group  signing 
their  names.  Subjects  reported  significantly  more  feelings  and  symptoms 
with  neurotic  implications  under  anonymous  conditions  than  when  required 
to  sign  their  names. 

The  effects  of  anonymous  versus  nonanonymous  data  collections  seems 
to  be  related  to  the  content  of  the  data.  Wiseman  (1972)  conducted  a 
public  opinion  poll  on  current  social  issues  using  tliree  samples  of 
Boston  households  controlled  for  socioeconomic  factors.  Data  were  col- 
lected from  one  sample  by  mailed  questionnaires,  from  the  second  by 
telephone  interview,  and  from  the  third  by  personal  interview.  For  eight 
of  the  ten  questions,  the  results  under  the  three  conditions  were 
similar;  but  for  two  questions,  one  on  contraception  and  one  on  abortion, 
the  results  differed  significantly.  On  the  anonymous  questionnaire  more 
people  seemed  to  bo  in  favor  of  such  programs  than  in  either  the  telephone 
or  personal  interview.  Wiseman  concluded  that  sensitive  issues  involving 
socially  accepted  or  rejected  answers  will  effect  more  response  bias  in 
interviews  than  in  questionnaires. 

In  the  Klein,  Maher,  and  Dunnington  (1967)  study  described  above, 
items  themselves  produced  variable  distortion.  Items  dealing  with  salary 
and  with  ratings  of  top  management  produced  consistent  positive  distortions 
under  identified  conditions,  whereas  items  dealing  with  work  pressure  and 
the  subject's  manager  produced  little  or  no  distortion.  Dunnette  and 
Heneman  (1956)  also  found  that  the  amount  of  response  distortion  depended 
upon  tlie  content  of  the  items  comprising  the  questionnaires. 

Rosen  (1960),  as  a result  of  a study  with  college  freshmen  completing 
signed  or  unsigned  questionnaires  on  the  effectiveness  of  a reading  program, 
concluded  that,  when  respondent  identification  is  essential  for  correlation- 
al or  followup  purposes,  the  straight-forward  approach  is  preferable  to  a 
number  coding  system.  For  sensitive  issues  or  where  there  is  expected 
distortion,  it  may  be  advisable  to  use  an  anonymous  questionnaire.  The 
other  articles  discussed  above  appear  to  support  Rosen's  recommendation. 

In  summary,  it  appears  that  anonymity  depends  not  only  on  unsigned 
questionnaires  but  also  on  the  conditions  under  which  the  questionnaires 
are  administered.  In  addition,  it  appears  anonymity  only  makes  a difference 
when  information  on  sensitive  areas  is  collected. 


Effec ts  of  Administration  Time 


Most  of  the  studies  conducted  to  evaluate  the  effects  of  administra- 
tion time  were  done  using  achievement  and  performance  tests  which  arc 
beyond  the  scope  of  this  study.  The  data  were  not  pertinent  to  this  re- 
view as  they  were  based  on  the  number  of  riglit  answers  or  total  individual 
scores.  The  related  topic  of  questionnaire  length  is  discussed  in  Chapter 


VIII. 


Miron  (1961)  in  a study  using  the  semantic  differential  did  vary 
directions  in  terms  of  how  much  time  the  subject  was  to  taxe  to  respond  to 
an  item.  One  group  was  instructed  to  mark  all  items  at  a fairly  rapid 


TX-9 


pace  and  to  attempt  to  recall  and  duplicate  their  markings  on  the  immed- 
iate retest.  A second  group  was  instructed  to  proceed  at  a slow  rate 
throughout  the  testing  and  to  recall  and  duplicate  markings  on  the  retest. 

A third  group  was  instructed  not  to  try  to  recall  original  testing  judg- 
ments and  to  proceed  rapidly  on  the  retest.  The  fourth  group  was  instructed 
not  to  recall  but  to  proceed  slowly.  Test-retest  correlationswere  computed 
for  each  of  the  groups  and  were  all  uniformly  high.  The  standard  errors 
of  substitutions  were  found  to  range  between  .24  and  .32  for  groups  one  and 
three  respectively,  with  an  average  absolute  deviation  range  of  .10  to  .14. 

It  appears  from  the  lack  of  studies  in  this  section  that  further 
research  is  needed  on  the  effects  of  administration  time  on  subject's 
motivation  and  on  the  effects  of  setting  time  limits  for  completing  ques- 
tionnaires . 


Effects  of  Characteristics  of  Questionnaire  Administrators 


This  section  reviews  the  effects  that  certain  characteristics  of 
questionnaire  administrators  have  on  the  responses  received  from  the  people 
completing  the  questionnaire.  Some  of  the  studies  dealing  with  interview- 
ers appear  to  be  genera lizable  to  questionnaire  administrators. 

Sex  of  the  interviewer.  Colombotos,  Elinson,  and  Loewenstein  (1969) 
studied  the  effect  of  the  interviewer's  sex  on  interview  responses.  They 
found  essentially  no  difference  in  the  reporting  of  psychiatric  symptoms 
to  male  and  female  interviewers  in  a community  survey.  However,  they  did 
speculate  that  differences  in  response  patterns  according  to  the  interview- 
er's sex  may  depend  on  subject  matter  as  well  as  on  the  composition  of  the 
respondent  populations  and  other  characteristics  of  the  specific  survey 
situation.  They  recommended  that  the  rationales  commonly  presented  for 
having  either  male  or  female  interviewers  be  critically  reexamined.  Thumin 
(1962)  found  that  the  percent  of  people  admitting  to  insomnia  differed  sig- 
nificantly according  to  the  sex  of  the  interviewer.  Twenty-two  percent  of 
the  subjects  interviewed  by  male  interviewers  reported  having  insomnia, 
compared  to  13%  of  the  subjects  interviewed  by  females.  No  interaction 
effects  between  sex  of  interviewer  and  sex  of  subject  was  found.  In  a 
study  conducted  by  Boyd  and  Westfall  (1965a), it  was  found  that  women  had 
better  ratings  as  interviewers  than  men. 

Race  of  the  administrator.  Most  of  the  studies  conducted  to  determine 
if  the  race  of  the  Investigator  had  an  effect  on  responses  involved  ques- 
tionnaires concerned  with  race.  In  a study  conducted  by  Summers  and 
Hammonds  (1966)  a Negro  attitude  scale  was  administered  by  two  investigators. 
In  a portion  of  the  groups  both  investigators  were  white.  In  the  remainder 
of  the  groups  there  was  a Negro  and  a white  investigator.  The  results  in- 
dicated that  socially  acceptable  answers  to  the  Negro  attitude  scale  were 
reported  with  greater  frequency  when  one  of  the  investigators  was  Negro. 
However,  the  phenomenon  was  mure  pronounced  among  certain  strata  of  respond- 
ents than  among  others,  suggesting  that  the  effects  should  be  viewed  as 
the  result  of  interaction  between  investigator  and  respondent  characteris- 
tics. Similarly,  Sedlacek  and  Brooks  (1972)  measured  the  attitudes  of 
whites  toward  blacks  with  the  Situations  Attitude  Scale.  Results  indicated 


that  there  were  no  measurable  effects  attributable  to  the  race  of  the 
person  administering  the  Situational  Attitude  Scale. 

Sattler  (1970)  reported  on  a comparative  review  of  studies  that  con- 
sidered the  effects  of  the  experimenter 's /interviewer 's  race  on  physiolog- 
ical responses,  task  performance,  intelligence  testing,  personality  scores, 
attitudes  and  preferences,  speech  patterns,  interviewing,  and  psychothera- 
peutic or  counseling  relationships.  One  finding  reported  was  that  respond- 
ents more  often  gave  socially  desirable  answers  to  interviewers  whose  race 
was  different  from  theirs,  particularly  if  their  social  status  was  lower 
than  that  of  the  interviewer  and  the  topic  of  the  question  was  threatening. 

Trent  (1954)  hoped  to  find  an  effect  of  the  investigator's  race  on 
responses  by  asking  black  and  white  kindergarten  children  to  select  their 
mother  from  three  photographs  of  models  (one  black,  one  brown,  and  one  white) 
using  either  a white  or  black  investigator.  He  found; 

1.  When  testing  white  children,  a white  investigator  produced 
more  responses  to  white  and  brown  mothers;  but  with  a black 
investigator,  the  choices  v/ere  for  a black  or  white  mother. 

2.  The  results  differed  little  with  black  youngsters  except  that 
with  the  black  investigator  they  chose  brown  or  black  mothers 
more  often. 

3.  The  black  children  avoided  making  any  decision  25%  of  the 
time  with  a white  investigator  but  not  at  all  with  a black 
investigator,  while  there  were  no  evasions  by  the  white 
children  in  either  condition. 

The  results  of  the  study  by  Schuman  and  Converse  (1971)  indicated 
that  the  race  of  the  interviewer  affected  only  responses  to  questions 
about  militancy  and  hostility  toward  whites,  and  not  responses  to  non- 
racial  questions  or  questions  about  discrimination.  Black  interviewers 
obtained  more  militant  answers,  particularly  from  lower  socioeconomic 
status  black  respondents.  White  reports  to  white  interviewers  indicated 
a higher  level  of  militancy  for  upper  than  lower  income  blacks,  while 
black  interviewers  received  a fairly  even  distribution.  Schuman  and 
Converse  attributed  this  difference  to  a "white  effect"  rather  than  to  a 
"black  effect."  One  other  question  on  favorite  entertainers  showed  dif- 
ferences by  race  of  interviewer,  indicating  that  the  interviewers'  race 
can  establish  different  frames  of  reference  even  in  nonsensitive  areas. 

In  a study  by  Babatz  (1967)  120  Negro  undergraduates  were  adminis- 
tered the  Test  Anxiety  Questionnaire  under  eight  experimental  conditions. 

It  was  found  that  Negro  subjects  tested  by  a Negro  examiner  reported  loss 
anxiety  than  those  tested  by  a white  examiner. 

Other  characteristics  of  administrators.  Ehrlich  and  Riesman  (1961) 
investigated  the  tendency  of  teen-aged  female  students  to  give  socially 
desirable  answers  to  authority  figures.  A socially  desirable  answer  was 
defined  as  showing  stronger  ties  to  parents  or  other  adults  than  to  peers. 


Less  socially  desirable  answers  were  more  often  given  to  younger  inter- 
viewers and  to  more  flexible  and  less  authoritative  interviewers  (as 
judged  by  personality  scores).  However,  for  interviewers  over  53,  person- 
ality did  not  make  a difference  as  people  over  53  were  seen  as  authority 
figures  regardless  of  their  personality  characteristics.  Respondents 
under  16  did  not  show  as  clear  a differential  by  age  of  interviewer  as 
did  those  between  16  and  18.  Sattler  (1970)  reported  that  the  greater  the 
disparity  between  the  status  of  the  interviewer  and  that  of  the  respondent, 
the  greater  the  tendency  for  biased  responses.  And  S legman.  Pope,  and  Blan 
(1969)  found  that  more  productive  responses  were  elicited  by  high  than  low 
status  interviewers. 

Atkin  and  Chaffee  (1972)  tested  the  ingratiating  effect  that  an  inter- 
viewer may  have  over  a subject.  In  one  study  residents  were  asked  about 
their  opinions  about  firefighters.  Half  of  the  subjects  were  told  their 
interviewers  were  firemen  while  the  other  half  believed  they  were  only 
students.  In  another  related  study  mothers  were  asked  their  opinions  of 
violence  on  TV.  Half  were  told  their  interviewers  were  on  a Federal  com- 
mittee investigating  TV  violence,  while  the  other  half  were  told  their 
interviewers  were  students.  The  results  showed  significant  differences 
between  the  two  groups  in  each  study,  which  suggests  a subject  will  try 
to  answer  favorably  in  the  eyes  of  the  interviewer,  if  the  subject  can  de- 
termine some  means  of  response  bias. 

Quinn  (1967)  examined  the  hypothesis  that  performance  raters  would 
tend  to  rate  subordinates  higher  who  were  most  like  themselves,  using  mili- 
tary officers.  Results  indicated  that  there  was  no  evidence  that  perform- 
ance ratings  were  influenced  by  similar  characteristics  of  rater  and  ratee. 
Johnson  (1958)  had  company  interviewers  rate  job  applicants,  and  the  appli- 
cants rate  the  interviewer.  He  concluded  that  personnel  selection  is 
largely  a matter  of  harmony  of  personal  characteristics  between  the  inter- 
viewer and  the  interviewee. 

Some  studies  dealt  with  the  experience  of  the  interviewer  related  to 
the  responses  received.  Smith  and  Hyman  (1950-51)  found  that  interviewers 
with  more  than  a year  of  experience  made  fewer  errors  in  recording  data 
than  those  with  no  interviewing  experience.  Schyberger  (1967)  reported  that 
the  results  of  a study  showed  nonsignificant  differences  between  interview 
completion  rates  for  experienced  and  inexperienced  interviewers,  and  that 
the  training  and  experience  of  the  interviewer  had  no  effect  on  the  number 
of  deviations  they  made  from  the  instructions.  Boyd  and  Westfall  (1965a) 
reported  that  all  interviewers  improved  with  experience,  and  training  im- 
proved interviewers  with  a high  school  education  but  had  little  effect  up- 
on interviewers  with  a college  education. 

More  research  needs  to  be  done  on  the  characteristics  of  the  admini- 
strators of  questionnaires.  No  studies  were  uncovered  related  specifically 
to  military  personnel.  For  example,  the  military  rank  of  the  person  ad- 
ministering a questionnaire  may  have  an  effect,  as  might  whether  the  ad- 
ministrator is  in  the  military  or  not. 


Effects  of  Administration  Conditions 


The  effects  of  questionnaire  administration  conditions  was  studied 
by  a number  of  authors.  In  a study  conducted  by  Hinrichs  and  Gatewood 
(1967),  male  technical  employees  in  a large  national  organization  rated 
their  degree  of  satisfaction  or  dissatisfaction  with  various  aspects  of 
their  work.  It  was  found  that  conditions  under  which  the  survey  was  ad- 
ministered did  have  an  effect  on  response.  When  employees  were  surveyed 
on  their  job  location  under  the  supervision  of  a company  representative, 
there  was  a tendency  to  respond  more  favorable  to  a significant  number  of 
general  opinion  questions,  particularly  questions  dealing  with  the  company 
in  general,  than  when  they  were  permitted  to  respond  to  a questionnaire 
mailed  to  their  home. 

Green  (1951)  determined  that  a larger  percentage  of  people  attempted 
to  fake  on  the  Kuder  Preference  Record,  the  Guilford  Inventory  of  Factors, 
and  the  Guilford-Martin  Inventory  of  Factors  when  these  tests  were  used 
for  serection  purposes  than  when  they  were  administered  to  a control  group. 
Rainio  (1956)  also  studied  the  effect  of  the  selection  situation  on  re- 
sponses to  questionnaires.  His  experiment  showed  a significant  difference 
between  research  and  selection  situations  for  various  traits.  In  the 
selection  situation  there  was  a trend  to  higher  scores  on  those  variables 
shown  to  have  higher  correlations  with  the  criterion.  Heron  (1956)  design- 
ed an  experiment  to  measure  the  effect  of  differences  in  test  conditions. 
Applicants  for  the  job  of  omnibus  conductor  were  given  a two  .part  person- 
ality test  covering  emotional  maladjustment  and  sociability.  In  oi;e  case 
it  was  administered  as  part  of  the  application  process  along  with  a health 
examination.  In  another  situation  the  test  was  administered  to  individuals 
after  they  had  been  hired.  There  was  a statistically  significant  differ- 
ence between  the  mean  scores  and  variances.  The  group  administered  the 
questionnaire  as  part  of  tVie  application  process  had  a higher  mean  score 
and  greater  variance. 

Several  of  the  studies  on  the  effects  of  questionnaire  administration 
conditions  were  concerned  with  raters  and  their  performance.  Bayroff, 
Haggerty,  and  Rundquist  (1954)  studied  the  validity  of  rating  related  to 
rating  techniques.  Officer  students  served  as  a rater-ratee  population 
using  two  types  of  graphic  rating  scales  and  two  modifications  of  the 
forced  choice  technique.  Results  indicated  that  ratings  earlier  in  a 
series  were  more  valid  than  those  at  the  end. 

Freeberg  (1969)  studied  the  relevance  of  rater-ratee  acquaintance 
in  the  validity  and  reliability  of  ratings.  Unacquainted  subjects  worked 
in  three-man  groups  under  relevant  and  irrelevant  acquaintance  conditions. 
The  subjects  rated  one  another  on  scales  that  defined  several  cognitive 
skills.  They  were  also  rated  on  these  same  scales  by  observers  who  were 
dependent  on  visual  information  only  and  were  unacquainted  with  the  group 
members  or  the  nature  of  the  task  being  performed.  Group  members  under 
the  relevant  acquaintance  condition  achieved  consistently  good  validity 
ratings  for  all  three  cognitive  areas,  with  the  best  validity  rating  on 
mathematical  ability.  Validity  under  the  irrelevant  acquaintance  condition 
was  nil  on  all  scales.  Observers  achieved  significant  validity  (although 
at  lower  levels  than  participating  group  members)  only  for  ratings  under 
the  relevant  acquaintance  condition. 


Shen  (1925)  found  that  when  28  subjects  ranked  each  other  on  friend- 
ship and  eight  other  traits  (intellectual  quickness,  intellectual  profound- 
ness, memory,  impulsiveness,  adaptability,  persistence,  leadership,  and 
scholarship)  there  was  a tendency  to  overestimate  friends  on  all  traits 
except  impulsiveness  and  to  underestimate  those  rated  as  less  intimate, 

Mayo  (1956)  concluded  that  for  peer  racing  there  is  a substantial  halo 
effect.  This  conclusion  was  based  on  a study  of  peer  ratings  of  intelli- 
gence and  effort  with  objective  measures  of  both. 

It  is  apparent  that  additional  research  is  needed  on  the  effect  of 
administration  conditions.  Such  research  should  include  the  study  of 
fatigue  factors. 


Effects  of  Other  Factors  Related  to  Questionnaire  Administration 

Many  things  affect  the  data  collected  by  questionnaires,  raters,  inter- 
views, and  observers.  This  section  discusses  those  uncovered  by  the  review 
of  the  literature:  investigator  bias;  observer  bias,  halo  effects,  and  the 

biasing  effect  of  interviews. 

Investigator  bias.  One  of  the  main  sources  of  bias  comes  from  the 
researcher  himself.  Kornhauser  (1947)  discussed  ,his  problem  of  bias  in 
research.  He  identified  several  biases:  choice  of  subject  matter;  study 

design  and  procedure;  unfair  or  loaded  phrasing  of  questions;  and  interpre- 
tation and  reporting  of  results.  He  felt  the  source  of  such  biases  are 
the  researcher's  relationship  with  the  client,  the  researcher's  personal 
involvement  in  a particular  theoretical  position  or  research  technique,  and 
those  personal  traits  attributable  to  class,  race,  and  political  ideology. 

To  reduce  the  impact  of  bias  he  felt  that  researchers  need  to  be  aware  of 
such  problems,  need  to  seek  critiques  from  independent  sources,  pursue 
public  scrutiny  through  publication  of  reports,  and  continue  to  pursue 
technical  improvement  in  opinion  research. 

Many  things  the  researcher  really  is  not  aware  of  have  an  influence  on 
results.  Jensen  and  Schmitt  (1970)  designed  a study  to  determine  the  extent  to 
which  responses  to  test  items  of  the  type  frequently  found  in  personality 
inventories  would  be  influenced  by  the  title  associated  with  the  test. 

An  instrument  was  constructed  and  administered  to  eight  treatment  groups. 

Each  administration  differed  primarily  in  the  title  the  test  bore.  The 
dependent  variables  were  measures  of  the  tendency  to  lie,  respond  defen- 
sively, answer  carefully,  and  complete  questions.  Subjects  tended  to  lie 
and  respond  more  defensively  to  titled  tests  than  to  a test  having  no 
title  and  administered  under  non threatening  conditions.  All  other  com- 
parisons were  not  statistically  significant. 

Dillehay  and  Jernigan  (1970)  tested  the  hypothesis  that  biased  ques- 
tionnaires are  effective  in  inducing  changes  in  the  subsequent  opinions  of 
respondents.  Systematically  biased  and  control  questionnaires  were  con- 
structed in  a manner  designed  to  elicit  either  harsh,  lenient,  or  neutral 
opinions  of  respondents  concerning  the  treatment  of  criminals.  After 
answering  one  form  of  these  treatment  questionnaires,  respondents 


registered  their  opinions  on  standardized  attitude  scales.  The  results 
indicated  that  the  treatment  questionnaires  were  successful  in  manipulating 
responses  to  lenient  bias.  Subjects  displayed  more  lenient  attitudes  after 
exposure  to  the  lenient  form  than  after  exposure  to  either  the  neutral  or 
harsh  forms  of  the  questionnaire. 

Question  bias.  Suchman  and  Guttman  (1947)  gave  four  suggestions  for 
eliminating  question  "bias:"  asking  many  questions  on  the  same  topic; 
determining  by  scale  analysis  whether  questions  ask  the  respondents  about 
the  same  dimensions  of  opinion;  asking  "How  strongly  do  you  feel  about 
this?"  after  each  opinion  question;  and  relating  the  content  of  opinion 
to  intensity  of  feeling. 

Observer  bias.  O'Leary  (1973)  and  Skindrud  (1972)  studied  observer 
bias  in  field  studies.  O'Leary  found  that  simply  informing  observers  of 
experimental  hypotheses  did  not  produce  observational  data  consonant  with 
those  hypotheses.  However,  questionnaire  responses  following  an  experiment 
with  different  induced  expectations  did  produce  global  data  consonant  with 
the  experimental  hypotheses.  He  also  found,  if  observers  are  informed  of 
the  experimental  hypotheses  and  the  investigator  provides  daily  feedback 
to  them  indicating  how  well  their  data  support  his  hypotheses,  observers 
will  report  data  consonant  with  those  hypotheses.  Skindrud  (1972)  led 
three  groups  of  observers  to  expect  different  outcomes  from  their  observa- 
tions. Even  though  the  groups  expected  different  outcomes,  they  were 
totally  unbiased  in  their  reports  of  deviant  behavior  in  group  comparisons. 
Failure  to  obtain  evidence  for  observer  bias  in  spite  of  the  demonstrated 
manipulation  of  observer  expectations  was  attributed  to  the  precautions 
taken  to  assure  high  levels  of  observer  accuracy. 

Halo  effects.  Several  studies  discussed  the  halo  effect,  which  is 
the  tendency  for  trait  ratings  to  reflect  in  part  the  rater's  general  im- 
pression of  the  person  he  is  rating.  Bingham  (1939)  reviewed  the  results 
from  two  examining  boards  responsible  for  rating  29  candidates  for 
executive  director  positions  in  two  Pennsylvania  counties.  He  found  the 
correlations  between  rating  for  the  general  category  "Personal  Fitness" 
and  the  ratings  for  specific  traits  such  as  voice,  poise,  freedom  from 
bias,  and  ability  to  plan  and  organize  to  be  positive  and  rather  high. 
Johnson  and  Vidulich  (1956)  found  that  halo  effect  is  a judgmental  error 
rather  than  the  effect  of  an  objective  correlation  of  traits.  In  their 
study  one  group  rated  five  individuals,  one  individual  per  day  on  five 
traits,  while  another  group  rated  five  individuals  on  one  trait  per  day. 
Johnson  (1963)  reanalyzed  the  data  and  found  that  the  usual  interaction 
between  raters  and  individuals  was  found  to  be  significant  under  both 
experimental  conditions.  Hence  he  concluded  that  the  evidence  for  halo 
effect  due  to  judging  operations  remains  questionable.  Bucklow  (1960) 
concluded  that,  if  items  are  constructea  so  "as  to  relate  to  clearly  ob- 
servable aspects  of  behavior  which  do  not  overlap,"  rating  will  be  improved, 
although  "halo"  cannot  be  eliminated. 

In  a study  conducted  by  Gordon  (1972),  the  results  indicated  that 
neither  the  differential  accuracy  phenomenon  (the  situation  where  correct 
behavior  is  identified  more  accurately  than  incorrect  behavior)  nor  the 
overall  accuracy  of  ratings  were  related  to  the  favorability  of  the 


rater's  general  impression  of  the  ratee.  He  concluded  that  these  findings 
make  suspect  the  current  practice  of  operationalizing  leniency  error  by 
use  of  the  average  level  of  favorability  of  global  rating.  Bayroff, 

Haggerty,  and  Rundquist  (1954)  found  that  the  average  of  a number  of 
ratings  was  more  valid  than  a single  rating  per  ratee.  Rappard  (1950) 
also  found  that  mutual  arrangement  between  a number  of  raters  is  felt  to 
enhance  greatly  the  correctness  of  the  rating. 

Zavalloni  and  Cook  (1965)  concluded  that  ratings  of  unfavorable  as  well  as 
neutral  items  are  influenced  by  raters'  attitudes.  Extreme  judges  make 
fine  discriminations  at  their  own  end  of  the  scale  and  lump  together  the 
items  at  the  other  end.  Falk  and  Bayroff  (1954)  concluded  that  the  rater 
is  the  principal  source  of  contamination  in  studies  using  ratings. 


Biasing  effect  of  interviews.  Many  studies  have  been  conducted  to 
show  the  biasing  effect  of  the  interview.  Since  unstructured  interviewing 
is  not  within  the  focus  of  this  review,  only  a few  of  these  studies  are 


discussed  below  to  indicate  the  scope  of  the  bias  and  some  of  the  recom- 


mendations for  controlling  it. 


In  a study  by  Stanton  and  Baker  (1942),  five  professionally  trained 
interviewers  obtained  significantly  more  correct  recognitions  of  previous- 
ly exposed  geometric  figures  when  they  knew  the  identity  of  the  correct 
figures  than  when  they  did  not.  In  contrast,  in  a study  by  Lindzey  (1951) 
graduate  students  with  training  in  interviewing  methods  failed  to  obtain 
significantly  more  correct  recognition  of  previously  exposed  geometric 
figures  when  they  knew  the  identities  of  the  correct  figures  than  when 
they  did  not. 

Hanson  and  Marks (1958)  reported  that  the  factors  leading  to  signifi- 
cant effects  of  the  interviewer  upon  results  are;  relatively  high  ambigu- 
ity in  the  concept  or  wording  of  the  inquiry;  the  interviewer  "resistance" 
to  a given  question;  and  additional  questioning  or  probing.  Ferber  and 
Wales  (1952)  reported  that  interviewer  bias  could  exist  without  being 
apparent  in  an  analysis  of  overall  sample  distributions.  The  direction 
of  bias  did  not  appear  to  be  uniform.  Cahalan,  Tamulonis,  and  Verner 
(1947)  concluded  that  the  least  interviewer  bias  was  found  in  questions 
that  could  be  answered  "Yes"  or  "No".  Shapiro  and  Eberhart  (1947)  report- 
ed that  interviewer  bias  cn  attitude  questions  resulted  from  differences 
in  the  interviewing  method  used,  differences  in  the  degree  of  success  in 
eliciting  factual  information,  and  differences  in  classifying  the  respond- 
ent's answers. 


Back,  Hill,  and  Stycos  (1955),  by  analyzing  the  data  reported  from 
interviews  in  a fertility  program  in  Puerto  Rico,  found  reproducibility 
differences  which  were  attributed  to  the  interviewer  and  not  to  a response 
set  of  the  respondents  to  four  Guttman  scales.  Two  "traits"  were  found 
among  the  interviewers  which  were  negatively  correlated:  conscientiously 

completing  the  questionnaire,  and  understanding  the  study.  The  resulting 
effect  is  either  a quality  interviewer  or  a quantity  interviewer,  which 
should  be  decided  by  the  type  of  data  needed.  Smith  and  Hyman  (1950-51) 
concluded  that  interviewer  expectations  had  a more  powerful  effect  on 
the  results  (recording  errors)  than  did  the  interviewer's  ideological 
preferences . 


*4- 


Chapter  X 


CHARACTERISTICS  OF  RESPONDENTS  THAT  INFLUENCE 
QUESTIONNAIRE  RESULTS 


This  chapter  discusses  various  types  of  response  bias.  Response 
bias  refers  to  the  tendency  of  subjects  to  respond  to  questions  in  a 
pattern  or  set  regardless  of  the  content  of  the  question.  One  hundred 
thirty-seven  studies  were  searched  indicating  that  the  subject  is  rec- 
ognized as  an  important  aspect  in  questionnai  chnology.  These 

studies  are  discussed  in  terms  of:  item  forni  i.ases;  social  desira- 

bility response  set;  acquiescence  response  set,  extreme  response  set; 
the  effects  of  attitudes  on  responses;  and  the  effects  of  demographic 
characteristics  on  responses. 

Cronbach  (1950)  and  Horn  and  Cattell  (19b5)  examined  the  disturbing 
effect  of  response  bias  on  test  reliability  and  validity.  Fricke  (1957) 
asserted  that  response  bias  could  explain  repeated  findings  that  well- 
adjusted,  successful  persons  obtain  more  abnormal  scores  on  the  subtle 
scales  of  the  MMPI  than  maladjusted,  unsuccessful  persons.  Rorer  (1965), 
however,  ccocluded  that  if  a bias  to  consistently  respond  in  a particular 
way  exists,  it  would  be  eliminated  by  rewording  the  questions  in  the 
opposite  direction.  His  results  indicated  that  response  bias  could  be 
attributed  simply  to  the  content  of  the  question,  th<.refore,  as  an  inter- 
vening variable,  would  have  only  minor  influence. 

Most  of  the  research  on  response  bias  does  substantiate  its  existence. 
Nunnally  and  Husek  (1958)  demonstrated  response  bias  by  substituting  randor’ 
chosen  foreign  words  for  meaningful  components  of  test  items  and  then 
measuring  the  predisposition  of  subjects  to  give  particular  answers  to 
these  ambiguous  questions.  In  a similar  study  McCord  (1951)  designed 
questions  that  could  not  be  answered  factually  or  truthfully  by  saying 
"yes,”  yet  he  found  between  87o  and  537o  affirmative  responses.  Berg  and 
Rapaport  (1959)  eliminated  the  questions  altogether  and  had  their  subjects 
answer  imagined  questions;  they  found  a great  tendency  among  their  re- 
spondents to  choose  culturally  valued  express  ons  such  as  "yes,"  "true," 
and 'agree."  Other  researchers  such  as  Webster  (1960)  have  found  high 
correlations  between  response  patterns  on  personality  inventories  and 
personality  measures  like  social  alienation  and  schizoid  functioning. 

Sudman  and  Bradburn  (1974)  have  examined  many  possible  sources  for  re- 
sponse bias  and  the  effect  this  variable  has  on  error  in  research.  The 
remainder  of  this  chapter  will  describe  the  studies  done  during  the  last 
twenty-five  years  in  identifying  the  possible  sources  of  response  bias. 


I tern  Format  Biases 


It  has  been  shown  that  response  bias  is  related  to  the  format  of  the 
question  and  the  methods  of  response  available.  Cronbach  (1946)  and 
Miklich  (1966)  have  demonstratea  how  item  ambiguity  produces  a recognizable 


X-1 


pattern  of  response.  Tajfel  (1959)  found  that  even  abstract  stimuli 
influence  the  physical  attributes  of  items  to  be  judged  when  no  other 
information  is  available.  Zajonc  and  Nienwenhyse  (1964),  however,  found 
that  the  frequency  of  common  words  as  answers  played  only  a negligible 
role  in  response  bias.  Sax  and  Carr  (1962),  working  with  omnibus  and 
subdivided  test  formats,  and  O'Dell  (1962),  studying  the  sequence  of  items, 
concluded  that  these  format  variables  were  significantly  correlated  with 
response  bias. 

Aschal  (1958)  concluded  that  response  bias  can  be  expected  from  all 
close-ended  questionnaires  where  answers  must  be  selected  from  two  or 
more  fixed  choices.  Jackson  and  Minton  (1963)  found  that  the  forced 
choice  format  of  selecting  one  of  a pair  of  choices  could  eliminate  a 
massive  response  set  where  some  respondents  tend  to  check  many  items  from 
a list  and  others  only  a few.  Considering  true-false,  multiple  choice 
and  card  sorting  methods.  Van  Der  Veen,  Howard,  and  Austria  (1970)  concluded 
that  all  three  formats  were  relatively  free  of  response  bias;  but,  like 
Cataldo,  Johnson,  and  Kellstedt  (1970),  they  found  card  sorting  to  show 
the  least  effects  of  format  response  bias.  One  additional  response  bias, 
that  of  random  markings,  was  noted  by  Flanagan  (1955),  and  seemed  to  be 
related  to  the  motivation  of  the  subjects  when  they  had  no  reason  to  take 
the  test. 


Social  Desirability  Response  Set 

Evidence  has  been  found  that  a response  set  or  style  exists  according 
to  how  favorably  society  would  view  the  response.  This  type  of  social 
desirability  response  was  found  by  Rugg  and  Cantril  (1942)  to  be  so  power- 
ful that  subjects  would  not  tend  to  deviate  from  social  norms  in  their 
answers  even  though  their  behavior  denied  the  opinion.  Warren  (1972) 
successfully  trained  some  subjects  to  a particular  response  set  but  found 
that  highly  socially  desirable  items  prevented  facilitation.  In  an 
attempt  to  further  define  this  factor  of  response  bias,  Fehrer  and  Strupp 
(1949)  determined  that  prestige  value  had  no  effect  on  responses  to  job 
title  preferences,  and  Krug  and  Northrup  (1959)  noted  that  on  self -descr ip- 
tion  inventories  response  time  decreased  as  social  acceptability  increased. 

The  influence  of  social  desirability  was  noted  by  French  (1958)  in 
scaling  instructions  that  included  the  phrase  "the  Air  Force  way."  When 
a respondent's  job  (Green,  1951)  or  his  incarceration  (Dubeck  et  al,)1971) 
depended  upon  his  answers,  there  was  a great  tendency  toward  socially 
desirable  responses.  Wiseman  (1972)  found  that  anonymous  questionnaires 
as  opposed  to  personal  interviews  were  necessary  in  order  to  surmount  the 
social  desirability  response  set  on  such  socially  sensitive  issues  as 
abortion  or  contraceptives.  Heilbrun  (1958)  noted  that  under  defensive 
conditions,  subjects  avoided  unfavorable  self-descriptive  adjectives  but 
did  not  necessarily  Increase  selection  of  favorable  adjectives. 

Several  authors  (Edwards  & Diers,  1963;  Dixon,  1970;  Potter  & 
Tinkleman,  1970;  Eysenck  & Eysenck,  1963;  Brod , Kernoff  & Terwillinger 
1964)  have  identified  subjects  with  a high  social  desirability  response 


rate.  They  found  these  respondents  to  give  more  true  responses  to  neutral 
items,  to  be  more  susceptible  to  manipulation  by  social  pressure,  to  more 
likely  be  introverts,  and  to  score  higher  on  a "lie"  scale.  Buss  (1959) 
found  that  this  response  set  was  elevated  with  some  subjects  when  given 
response  choices  styled  like  "trouble  controlling,"  "must  admit,"  and 
"tempted . " 

Faking  or  responding  with  socially  desirable  answers  which  are  not 
true  is  a response  error.  Izard  and  Rosenberg  (1958)  gave  instructions 
to  their  treatment  group  to  try  to  fake  their  answers  but  found  no  signi- 
ficant differences  between  those  and  the  control  group's  in  a forced  choice 
test.  Several  other  authors  (leftwich  & Remmers,  1962;  Eisenberg,  1965; 
Bartlett  & Doorley,  1967),  however,  obtained  significant  results  showing 
fakability  present  in  forced  choice  tests  under  varying  instructional  sets. 
Jones  (1959)  tried  to  neutralize  faking  by  instructing  subjects  to  do  so 
and  then  establishing  correlations  of  reliability  with  other  tests.  However, 
he  was  unable  to  achieve  high  correlations.  Cliff  (1968a) determined  that 
faking  responses  as  well  as  candid  ones  were  simple  functions  of  meaning 
space  due  to  the  great  unanimity  among  the  subjects  concerning  how  to  fake. 
Edwards  (1957a)noted  that  even  anonymity  failed  to  eliminate  the  social 
desirability  response  set. 

The  forced  choice  instrument  format  has  been  studied  for  its  suscep- 
tibility to  social  desirability  response  style.  Silverman  (1957)  and 
Karr  (19593) found  the  forced  choice  method  to  minimize  the  effect  of 
social  desirability,  while  Krug  (1958),  Howe  (1960),  and  Bernhardson  and 
Fisher  (1971)  found  the  factor  needing  control  in  forced  choice  tests. 

Isard  (1956)  concluded  that  in  forced  choice  formats,  ambiguous  items  tend- 
ed to  be  freer  of  social  desirability  response  set  than  positively  or 
negatively  worded  items.  Due  to  the  freer  response  choice  of  card  sorting, 
Edwards  and  Horst  (1953)  and  Edwards  (1955)  examined  the  method  for  social 
desirability  effect  but  found  it,  too,  needed  controls  to  eliminate  the 
bias.  Hillmer  (1958)  found  this  response  set  to  operate  whenever  the  sub- 
ject had  the  opportunity  to  respond  in  terms  of  it. 

Krieger  (1964)  and  Smith  (1967)  have  both  developed  procedures  for 
controlling  or  balancing  social  desirability  by  using  loaded  items  in  the 
test  and  then  adjusting  the  subject's  score. 


Acquiescence  Response  Set 


The  response  set  to  consistently  agree,  to  say  "yes,"  or  to  say  "true," 
is  called  acquiescence.  Upon  comparing  subjects  taking  attitude  measuring 
tests,  Lorge  (1937)  found  correlations  among  those  who  marked  "yes,"  "like," 
and  "1"  or  "2"  as  well  as  correlations  for  those  marking  "no,"  dislike,' 
and  "7"  or  "8."  Shipley,  Norris  and  Roberts  (1946)  noted  that  judgement 
time  was  decreased  when  subjects  were  to  choose  the  most  pleasant  color 
from  many  pleasant  colors,  or  the  most  unpleasant  one  from  many  unpleasant 
ones.  He  concluded  that  this  was  an  indication  of  an  acquiescent  response 
set.  Other  authors  (Jackson  & Messick,  1957;  Mahler,  1962;  Eysenck  & 
Eysenck,  1963a ; Foster , 1961;  Quinn,  1963)  have  identified  the  acquiescence 
response  set  as  a behavioral  attitude  to  agree  and  accept  even  if  subjects 


r 


X-3 


F 


must  alter  their  original  opinions  to  do  so.  Elliot  (1961)  determined 
that  acquiescence  was  highly  dependent  upon  test  construction  and  the  re- 
spondent's aptitude,  while  Hanley  (1965)  found  that  acquiescence  occurred 
with  difficult  ratlier  than  easy  inventory  material. 

Gage  and  Chatterjee  (i960)  and  Diers(1961)  have  further  concluded 
that  there  is  an  opposing  response  set  --  the  naysayer  --  which  they  found 
more  valid  than  yeasayers.  Wells  (1963)  identified  both  yeasayers  and  nay- 
sayers, but  found  more  distortion  in  survey  findings  due  to  the  former. 


( 


A still  unsettled  argument  is  whether  or  not  acquiescence  is  simply 
a personality  trait.  Couch  and  Keniston  (1960),  Frederiksen  and  Messick 
(1958),  Adams  (1956)^  and  Becker  and  Myers  (1970)- all  pointed  to  the  cor- 
relations of  personality  factors  with  acquiescence.  In  contrast,  Foster 
and  Grigg  (1963),  Eysenck  (1562),  and  Findikyan  (1969)  found  acquiescence 
unrelated  to  personality  factors  in  personality  measures,  but  conceded 
there  may  be  a relationship  in  sociopolitical  opinions  or  attitudes.  To 
confound  the  matter,  Cohn  (1956)  contended  that  the  F scale  is  contaminated 
by  an  acquiescence  response  set,  while  Small  and  Campbell  (1960)  asserted 
that  the  relationship  between  conformity  and  the  F scale  is  a function  of 
content  and  not  acquiescence. 

Controls  for  acquiescence  have  been  researched  and  some  information 
is  available  on  the  response  set's  effe-'t.  Wells  (1961)  has  detailed 
several  design  and  statistical  analysis  procedures  for  eliminating  the 
effect.  Clancy  and  Garsen  (1970)  found  that  absolute  scales  of  appeal 
were  distorted  by  yea-  and  naysaying  effects.  Banta  (1961)  and  Cloud  and 
Vaughn  (1970)  concluded  that  item  ambiguity  increases  an  acquiescent 
tendency,  but  that,  when  it  is  minimized,  balanced  keying  of  items  prevents 
contamination.  Campbell,  Siegman  & Rees  d967)  found  that  posit  Ive-negat  iA’e  rev- 
ersal of  items  did  not  entirely  eliminate  the  problem,  but  Findikyan  (1969)  con- 
cluded that  reversal  is  an  effective  control  if  the  items  are  not  awkward- 
ly worded.  Falthzik  and  Jolson  (1974)  determined  that  a higher  intensity 
of  agreement  is  reached  when  items  are  positively  stated  than  negatively 
stated.  Knowles  (1963),  while  finding  the  balancing  of  scales  of  dubious 
value  tc  counteract  acquiescence,  did  demonstrate  that  true-false  question- 
naires can  be  differentially  prone  to  acquiescent  response  set. 

There  is  a concern  that  social  desirability  and  acquiescence  are  re- 
lated in  such  a way  that  an  individual  with  a tendency  toward  conformity 
will  consistently  reflect  both  biases.  Several  authors  (Schultz,  1962; 

Strieker,  1962,  1963;  Gloye,  1964;  Liberty,  1965)  have  studied  the  relation- 
ship of  the  two  effects  and  found  no  correlation.  In  two  additional  inves- 
tigations, the  two  variables  were  studied  but  only  one  of  them  could  be 
established  to  exist  independently;  Siller  and  Chipman  (1963)  found  an 
acquiescent  response  set  factor,  and  Winters  and  Bartlett  (1966)  found  only 
an  independent  social  desirability  response  set. 

Extreme  Response  Set 

Several  studies  have  examined  the  possibility  that  an  extreme  response 
set  exists  where  some  individuals  tend  to  consistently  select  exaggerated 
choices  for  positions.  Rundquist  (1950)  found  a low  but  significant 


X-4 


correlation  for  preferred  personality  and  interest  items.  Rather  than 
use  it  as  a predictive  instrument,  the  author  suggested  attempts  be  made 
to  eliminate  the  effect.  Goldsamt  (1972)  found  evidence  that  an  extreme 
response  style  did  exist  on  neutral  content  items,  but  that  the  effect  was 
not  generalizable  due  to  its  interaction  with  content-extreme-response- 
style.  Mascaro  (1968)  examined  the  width  of  the  response  categories  and 
extreme  response,  but  found  no  significant  correlation. 

An  unnecessary  assumption  is  often  made  that  a "don't  know"  or 
middle  category  response  is  lacking  in  demonstrative  extremeness.  Worthy 
(1969)  and  Ziller  and  Long  (1965)  presented  evidence  that  this  response  is 
valid  and  can  be  related  to  dogmatism  as  a status-defense  mechanism.  Two 
studies  (Adams,  1956;  Cooper  & Cowen,  1962)  pointed  out  that  extreme  re- 
sponses are  not  necessarily  inhibited,  and  that  a lack  of  inhibition  would 
not  explain  the  bias  pattern. 

Soueif  (1958)  found  a positive  correlation  between  extreme  response 
style  and  intolerance  to  ambiguity.  Levy-Leboyer  (1955b)  , however , found 
that  subjects  who  consistently  omitted  items  were  affected  more  by  motiva- 
tion than  by  a persisting  psychological  trait.  Lucky  and  Grigg  (1964) 
examined  defensiveness  and  deviant  responses,  but  concluded  that  outside 
self -description  the  two  variables  were  unrelated. 


Effects  of  Attitudes  on  Responses 

A response  bias  attributed  to  an  attitude  is  one  which  is  influenced 
by  the  respondent's  opinion,  belief,  or  position.  Shen  (1925)  recognized 
the  disturbing  influence  that  acquaintance  had  on  raters  and  ratees. 
Hinckley(1932a) ,Prothro  (1955),  and  Ferguson  (1935a)found  that  by  using 
Thurstone's  methods  of  equal  interval  scaling,  judges  could  rate  items 
without  being  influenced  by  their  own  attitudes.  Bruvold  (1971),  however, 
confirmed  a competing  hypothesis,  that  anti-attitude  judges  would  rate  un- 
favorable items  higher  and  favorable  items  lower  than  pro-attitude  judges. 
Other  similarly  disturbing  attitude  effects  in  response  were  found  by 
Prothro  (1957)  concerning  personal  involvement  of  judges,  and  by  Mogar 
(1960)  involving  high  authoritarians  in  controversial  social  issues. 

Explanations  for  these  contrasting  research  conclusions  came  generally 
as  controls  developed.  Kendall  (1954)  found  unstable  or  changing  responses 
were  contributed  to  by  shifts  in  the  mood  of  the  respondent,  relative 
values  among  the  possible  choices,  and  the  degree  of  interest  present  in 
the  question.  Kelley,  Hovland,  Schwartz,  and  Abelson  (1955)  found  that 
blacks  and  whitCo  in  a competitive  situation  would  make  similar  judgements 
concerning  the  social  position  of  Negroes  but  when  separated,  blacks  tend- 
ed toward  extreme  responses.  Zimbardo  (1960)  found  no  differences  between 
pro-  and  anti-judges  when  well-structured  sentences  were  used;  but,  as  they 
became  more  ambiguous,  the  responses  became  more  attitudinally  biased. 
Upshaw  (1962)  noted  that,  if  the  judge's  own  position  was  outside  the 
range  of  responses,  bias  would  be  evident.  Ramlo  (1968)  was  able  to  shift 
judges'  responses  through  attitudinal  bias  by  instructing  them  to  disregard 
their  own  opinions,  which  they  could  not  do. 


X-5 


Other  factors  evidenced  in  attitudinal  bias  research  are  ego  involve- 
ment (Pauli,  19b8) , prestige  value  (Doncel,  Alimena,  and  Birch,  19A9)  , 
lack  of  attitude  or  position  (Georgoff,  Ifersker,  and  Murdick,  1972),  and 
task  avoidance  behavior  (Weitman,  1964).  In  addition.  Hill  (1953)  noted 
that  inconsistent  judgements  decreased  as  psychological  distances  between 
items  increased.  French  (1958)  suggests  that  preference  schedules  be 
rescaled  when  attitudinal  norms  of  groups  differ  greatly  from  the  standard. 

Effects  of  Demographic  Characteristics  on  Responses 

The  final  general  area  contributing  to  response  bias  is  the  effect  of 
demographic  characteristics.  Such  identifications  as  sex,  age,  race,  or 
education  have  been  examined  to  determine  if  similarities  of  such  variables 
among  respondents  tend  to  bo  related  to  a response  pattern.  Roslow  and 
Blankenship  (1939)  pointed  out  the  theoretical  need  for  designing  questions 
with  the  respondent's  background  in  mind.  Johnson  (1958)  found  that  in 
personnel  selection  , the  harmony  of  demographic  characteristics  played  a 
major  role  in  interviewer-interviewee  relations.  Schaie  (1962)  noted  that 
factor  matching  in  analysis  depended  on  demographic  knowledge  of  raters 
and  responders.  Jury  (1971)  determined  that  demographic  characteristics 
reflected  differences  in  workers'  views  of  organizational-type  variables, 
but  that  they  were  not  related  to  i.'.dividua  1- type  variables. 

Socioeconomic  class  has  been  identified  by  Soueif  (1958)  and  by 
Clancy  and  Garsen  (1970)  as  an  influence  of  bias  in  response  patterns. 

Sgan  (1967)  found  middle  class  children  to  bo  more  susceptible  to  experi- 
menters' influence  than  lower  class  children.  Race  was  found  to  be  an 
identifying  factor  in  extreme  response  rates  by  Sherif  and  Hovland  (1953), 
while  Sattler  (1970)  noted  that  response  bias  increased  in  interviews  is 
racial  disparity  grew.  Another  characteristic  bias,  level  of  education, 
has  been  shown  to  relate  to  a decrease  in  acquiescent  response  style 
(Falthzik  and  Jolson,  1974),  and  to  an  increase  in  nonacceptance  of  causal 
explanations  in  ambiguous  situations  (Nunnally  & Husek,  1958). 

Several  authors  have  identified  other  demographic  characteristic 
variables  such  as  age,  religion,  intelligence,  sex,  marital  status,  parent- 
hood, nationality,  urban  or  rural  residence,  income,  rank,  and  experience. 
Such  variables  have  been  correlated  with  biases  found  in  response  con.sis- 
tency  (Hart,  Faust,  Rowland  & Lucier,  1964;  Dakin  & Tennant,  1968;  Goldsamt, 
1972;  Flyer  & Carp,  1962;  and  Sicinski,  1970).  Aschal  (1958)  and  Wells 
(1963)  found  correlations  between  acquiescence  and  demographic  variables. 
Quinn  (1967)  in  a study  contrasting  previous  research,  found  no  relation- 
ship between  several  demographic  characteristics  of  raters  and  ratees  and 
their  ratings.  Bauer  (1947),  Ferber  (1966),  and  Ognibene  (1973)  found 
common  characteristics  of  youth  and  less  education  among  nonrespondents  in 
mail  surveys. 

Other  studies  have  explored  more  removed  characteristics  searching 
for  significant  differences  in  responses.  Bayroff,  Haggerty,  and  Rundquist 
(1954)  examined  "hard  raters"  and  "easy  raters"  but  found  no  differences 


in  validity.  High  and  low  "feeling"  persons  were  found  by  Frisbie  and 
Sudman  (1968)  to  make  long  answers  on  open-ended  questionnaires.  Ferber 
(1956)  suggested  that  survey  researctiers  determine  the  state  of  knowledge 
of  their  sample  to  avoid  a response  bias  by  persons  ignorant  of  the  issues 
and  by  persons  misinformed  about  the  issues.  Two  studies  of  military 
personnel  (Hollis,  1954;  Gilbert,  1956)  indicated  tiiat  the  influence  of 
occupational  environment  may  be  related  to  a bias  against  criticism  and 
for  acquiescence. 


Summary  and  Conclusions 

Response  bias  is  an  error  factor  in  questionnaire  technology  due  to 
a pattern  of  answers  made  by  the  respondent  that  appear  to  be  related  to 
extraneous  variables.  Several  areas  of  origin  for  response  bias  have  been 
studied  which  are  grouped  in  this  chapter  into  six  categories. 

1.  Format  biases  are  responses  influenced  by  the  question  stem  or  re- 
sponse alternatives.  Sequence  and  fixed  choice  responses  have 
been  related  to  this  bias. 

2.  Social  desirability  has  been  well  identified  as  a response  set 
where  persons  answer  according  to  the  norms  they  believe  society 
condones.  The  faking  of  responses  on  questionnaires  contaminates 
the  results,  and  controls  must  be  designed  to  prevent  its  operation. 

3.  Acquiescence  is  the  bias  demonstrated  by  yeasayers  who  tend  to  re- 
spond more  often  agreeably  than  disagreeably.  Some  dispute  remains 
over  this  bias  as  to  whether  it  is  actually  a personality  trait. 

4.  The  extreme  response  set  refers  to  the  pattern  of  answers  persons 
make  which  tend  to  be  unevenly  distributed  toward  one  or  both  poles. 

As  with  acquiescence,  some  research  indicated  that  this  response 
style  may  also  be  a personality  description. 

5 Attitudes  may  influence  responses  in  identifiable  patterns.  Opin- 
ions ; ;.u  beliefs  seem  to  be  related  to  a response  bias. 

6.  Demographic  characteristics  have  been  shown  to  be  related  to  re- 
sponse bias  Education,  age,  social  class,  etc.,  have  been  found 
influential  in  a response  pattern  especially  noted  by  consistency. 

Research  during  the  last  twenty-five  years  establishes  a very  strong 
case  for  the  existence  of  response  bias.  Studies  documenting  its  origins 
in  social  desirability,  questionnaire  format,  and  demographic  character- 
istics are  numerous.  More  evidence  is  needed  to  confirm  that  acquiescence, 
extreme  response  set,  and  attitudes  are  actually  biases  and  not  personality 
traits.  None  of  the  control  measures  examined  thus  far,  including  changing 
wording  direction,  balancing  scales,  using  card  sorts,  forced  choice,  or 
open-end, ?d  designs,  or  loading  questions,  have  convincingly  eliminated  re- 
sponse bias.  Kore  detailed  identification  and  control  methods  are  areas 
of  needed  further  research  in  response  bias. 


Y-7 


Chapter  XI 


CONSIDERATIONS  RELATED  TO  THE  EVALUATION 
OE  QUESTIONNAIRE  RESULTS 


Considerations  related  to  the  evaluation  ol  questionnaire  results 
was  another  area  not  stressed  durin;'  the  literature  review.  Some  articles 
were  reviewed,  however,  that  pertain  to  the  scoring  of  questionnaire 
results,  the  properties  and  uses  of  ipsative  scores,  and  data  analyses. 


Scoring  of  Questionnaire  Results 

Practical  considerations.  Erdos  noted  (1948b)  some  points  that  are 
often  forgotten  until  too  late:  that  both  time  and  money  can  be  saved  by 

planning  the  questionnaire  in  line  with  tabulation  requirements.  He  used 
sample  questions  to  illustrate  the  relationship  between  order  of  questions 
and  tabulation,  and  how  phrasing  of  questions,  sequence,  and  layout  can 
affect  tabulation  time.  He  also  pointed  out  that  whether  data  are  to  be 
tabulated  by  hand  or  by  machine  is  an  important  decision  and  should  be 
made  in  advance.  The  precoding  of  responses  whenever  possible  was  also 
recommended. 

Quite  early,  Bass  and  Wurster  (1956)  described  the  use  of  IBM  Mark  Sense 
cards  to  put  data  on  punched  cards.  They  noted  the  procedure  avoids  the 
expense  and  difficulty  of  coding  and  keypunching  large  volumes  of  raw  data. 
Of  course,  the  use  of  Mark  Sense  cards  has  been  largely  replaced  by  one 
of  a number  ol  optical  scanning  procedures  allowing  the  processing  of 
regular  sized  answer  sheets  and  booklets. 

Lyman  (1949)  examined  the  assumption  that  items  in  a multi-scale 
inventory  should  be  scrambled,  even  when  the  items  are  "obvious."  He 
compared  scrambled  items  and  items  blocked  according  to  scale  in  a school’ 
attitude  survey.  Two  high  school  senior  classes  were  given  the  tests, 
one  half  of  each  taking  the  alternate  version  which  was  followed  two 
weeks  later  by  the  other  version.  Test  scores  revealed  no  statistical  Iv 
significant  differences,  loading  the  author  to  conclude  that  blocking 
items  may  be  preferable  due  to  its  greater  ease  of  scoring. 

Other  considerations  related  to  scoring  questionnaires.  Methods  of 
scoring  questionnaires,  especially  attitude  scales,  were  discussed  by  a 
number  of  authors.  For  example,  Kundu  (19t0)  suggested  a method  for 
scoring  responses  on  thr^e  point  attitude  scales.  Assuming  a non-normal 
distribution  of  attitude  scores  and  non-neutral  trends  in  attitudes,  the 
"neutral"  responses  are  broken  into  positive  or  negative  and  the  responses 
are  scored  with  the  help  of  average  group  trends  and  weighted  scores  of 
the  individual  responses.  Peabody  reported  (1962)  that  there  is  a 


XI-1 


justification  for  scoring  items  dichol  omously  according  lc>  tlie  direction 
of  response,  as  is  done  when  bipolar  scales  are  analyzed  in  terms  of  the 
proportion  of  responses  in  each  direction  ol  the  basic  dichotomy.  The 
justification  is  based  upon  results  he  obtained  that  indicated  composite 
scores  reflect  primarily  the  direction  of  responses,  and  only  to  a minor 
extent  their  extremeness.  (He  also  noted  that  since  extremeness  scores 
are  reliable  and  are  largely  unreflected  in  the  usual  composite  score, 
they  may  have  quite  different  correlates  of  their  own.)  Matell  (1970), 
who  investigated  the  psychometric  characteristics  of  Likert-type  rating 
scales  consisting  ol  two  through  19  steps,  found  .that, by  collapsing  the 
steps  into  two  or  three  measurement  categories  for  analysis  ,no  lack  of 
precision  resulted,  Schuessler  (1952),  however,  raised  doubt  about  the 
validity  of  combining  response  categories  in  successive  approximations  of 
scalability.  He  showed  irregularities  between  analyses  of  a questionnaire 
form  in  which  an  "uncertain"  response  was  permitted  and  combined  as  an 
approximation,  and  a second  questionnaire  form  in  which  the  "uncertain" 
response  was  not  permitted.  Odesky  (1967),  working  with  paired  compari- 
sons with  a no  preference  option, suggested  the  advisability  of  either 
dividing  no  preference  responses  proportionate  to  preference  responses, 
or  disregarding  them  altogether.  The  basis  of  this  suggestion  was  that 
respondents  who  claim  neutrality  appear  to  exhibit  the  same  preference 
patterns  as  those  who  express  a preference. 

Two  methods  of  scoring  rating  scale  data  to  approximate  forced 
choice  results  were  reported  by  Karr  (1959a;  1959b).  One  was  called  the 
difference  method,  and  was  designed  to  have  maximum  stability.  The  other, 
called  the  zero-one  method,  was  designed  to  match  as  closely  as  possible 
the  method  of  scoring  for  the  forced  choice  format  personality  inventory 
with  which  the  scoring  methods  were  being  compared.  Karr  concluded  that 
by  using  any  one  of  several  methods  of  scoring  or  transforming  self-rating 
scale  raw  scores,  it  is  possible  to  approximate  dyadic  forced  choice 
lesults  with  considerable  saving  in  administration  time,  and  a small  gain 
in  test-retest  reliability. 

It  was  hypothesized  by  Schaie  (1963)  that  the  concurrent  validity  of 
questionnaires  can  be  increased  by  the  use  of  item  weights  obtained  by 
expert  scaling  , instead  of  by  using  conventional  unit  weights.  The  results, 
using  a high  school  personality  quiz,  showed  only  low  magnitude  increments 
in  validity,  however. 


Several  authors  reported  on  the  use  of  intensity  scores  as  distinguished 
from  content  scores.  Guttman  ( 1947 a)showed  how  intensity  scores  can  be 
obtained  by  either  the  fold-over  technique  or  the  two-part  technique.  The 
fold-over  technique  involved  weighting  extreme  responses  (positive  and 
negative)  as  2,  moderate  responses  as  1,  and  neutral  responses  as  0,  and 
summing  these  for  an  intensity  score.  The  two-part  technique  ascertains 
an  intensity  score  simply  by  following  each  question  with  the  query  "How 
strongly  do  you  feel  about  this?"  which  is  also  answered  on  a scale. 

Goldsamt  (1972),  working  with  content-free  stimuli,  however,  concluded 

that  dichotomous  scoring  methods  are  equivalent  to  intensity  scoring  methods. 


XI-2 


A rat ing- scoring  technique  for  evaluating  free  response  answers 
was  described  and  illustrated  by  Canter  (1953).  The  technique  involved 
using  raters  to  sort  responses  to  each  of  four  questions  into  a seven 
category  forced  normal  distribution.  The  score  for  each  respondent  was 
the  sum  of  the  category  numbers  assigned  by  the  raters  over  the  four 
questions.  Interrater  reliabilities  of  .85  and  .88  were  reported. 

A procedure  for  correcting  the  influence  of  social  desirability 
(SD)  response  set  in  opinion  research  was  reported  by  Smith  (1967). 

Several  SD  items,  to  which  it  is  assumed  the  true  response  is  known 
(e.g.,  "Do  you  like  everyone  you  meet?")  are  included  in  the  questionnaire. 
The  SD  score  from  these  items  is  then  correlated  with  each  of  the  other 
items  on  the  inventory.  The  responses  on  those  items  with  a statistically 
significant  correlation  can  then  be  corrected,  by  moving  the  response  one 
or  more  steps  from  the  socially  desirable  response,  to  give  a more 
accurate  result. 


Properties  .^nd  Uses  of  Ipsative  Scores 

During  the  literature  review,  some  attention  was  paid  (but  not  an 
exhaustive  review)  to  the  topic  of  ipsative  scores  since  their  properties 
are  not  well  known  but  should  be  understood  by  those  who  design  question- 
naires. It  was  in  1944  that  Cattell  (1944)  noted  that  psychological 
measurement  could  be  expressed  in  three  kinds  of  units;  interactive, 
normative,  and  ipsative.  Interactive  units  are  exemplified  by  the  typical 
"raw"  units  of  psychology,  where  one  is  measuring  the  interaction  of  the 
individual  and  his  environment.  Interactive  units  are  neither  dependent 
upon  any  other  scores  of  the  individual  measured  nor  upon  the  scores  of 
other  individuals.  Normative  units  are  interactive  measurements  relative 
to  a group  of  persons,  or  in  terms  of  a population  of  measurements  provided 
by  a population  of  persons.  Hence,  the  score  of  any  given  individual  is 
dependent  upon  the  scores  of  others  in  the  population.  Most  scores  in 
behavioral  measurement  are  of  this  type.  Ipsative  units,  on  the  other  hand, 
are  interactive  measurements  in  terms  of  a population  of  measurements 
within  an  individual.  Hence,  the  score  for  an  individual  on  a variable 
is  depjndent  upon  his  scores  on  other  variables. 

Several  derivations  of  the  three  forms  given  above  are  possible. 

Where  .interactive  measures  are  first  scored  normatively  and  then  scored 
ipsatively,  "normative  ipsative"  units  are  produced.  "Ipsative  normative" 
scores  are  also  obtainable  when  ipsative  units  are  themselves  treated 
normatively.  As  Cattell  ( 1944)  pointed  out,  however,  ipsative  normative 
scores  are  not  the  same  as  normative  Ipsative  scores. 

Ipsative  scores  can  be  obtained  in  one  of  two  ways:  arithmetically 

through  the  use  of  various  scaling  procedures;  and  experimentally.  Experi- 
mental iy  ipsative  scores  are  produced,  for  example,  by  the  forced  choice 
technleiue,  the  use  of  paired  comparisons,  and  the  Q-sort.  Examples  of  well- 
known  tests  that  produce  Ipsative  scores  are  the  Al  Iport-Vernon  Study  of 


Values,  the  Edwards  Personal  Preference  Schedule,  and  the  Kuder  Preference 
Record.  A set  of  ali..rlbute  measures  is  defined  as  ipsative  when  the  sum 
of  scores  over  all  attributes  is  a constant  for  each  entity. 

The  properties  of  ipsative  measures  were  investigated  by  Clemans 
in  1956  and  reported  in  a Psychometric  Monograph  in  1965.  Some  of  his 
major  findings, based  on  arithmetically  ipsatized  scores, and  related 
implications  and  recommendations  are  given  in  detail  below  since,  as 
previously  noted,  the  properties  of  ipsative  scores  are  still  not  well 
known. 

1.  There  is  always  a set  of  raw  or  absolute  measures  underlying  an 
ipsative  set.  They  may  be  very  difficult  or  impossible  to  obtain,  but  in 
theory  they  are  there. 

2.  There  will  be  a large  number  of  negative  values  in  any  ipsative 
correlation  matrix. 

3.  Ipsative  intercorrelation  matrices  are  nonbasic  or  singular  and 
thus  have  no  regular  inverse.  Hence,  if  regression  weights  are  to  be 
determined  for  a complete  set  of  ipsative  variables,  special  procedures 
(such  as  Iterative  techniques)  will  have  to  be  employed  or  one  of  the 
ipsative  variables  will  have  to  be  deleted. 

4.  The  least-square  estimate  of  a criterion  using  all  the  variables 
of  an  ipsative  set  is  identical  with  the  least-square  solution  with  any 
single  variable  deleted,  regardless  of  the  validity  coefficient  of  that 
variable. 

5.  Ipsative  scores  must  always  be  interpreted  as  relative  and  not 
absolute  measures.  Ipsative  variables  are  highly  interdependent. 

6.  Except  in  one  special  case,  the  ipsative  multiple  correlation  is 
always  less  than  the  multiple  correlation  for  the  same  variables  prior  to 
ipsatizing.  Whenever  possible  then,  absolute  measures  should  be  used  for 
measuring  attributes  of  behavior. 

7.  If  the  underlying  absolute  measures  have  zero  correlation  with 
each  other,  or  they  all  correlate  to  some  constant  degree,  the  ipsative 
intercorrelations  will  all  be  a negative  constant  value  determined  only 
by  the  number  of  variables.  This  again  shows  the  high  interdependence. 

8.  Under  certain  restrictions,  the  ipsative  covariance  matrix  and 
the  first  centroid  residual  of  the  absolute  measures  are  identical,  and 
the  property  seems  to  hold  very  well  even  without  the  restrictions.  This 
is  the  same  as  stating  that  a tremendous  amount  of  information  is  missing 
in  the  ipsative  set.  The  fact  that  this  information  is  missing  from  the 
Ipsative  set  will  make  it  next  to  impossible  to  make  anything  psychologi- 
cally meaningful  out  of  a factor  analysis  of  such  data,  other  than  a 
determination  of  the  rank  of  the  matrix. 


9.  If  the  means  and  variances  of  a set  of  scores  were  not  equated 
prior  to  ipsatizing  the  resulting  scores  will  have  little  meaning.  Although 
the  means  can  be  adjusted  after  ipsatizing,  the  variances  cannot.  Hence, 
the  ipsative  test  maker  must  be  as  certain  as  he  can  that  equal  variances 
are  maintained  for  the  absolute  scales  underlying  an  ipsative  set  even 
though  the  absolute  measures  cannot  be  observed.  How  this  is  to  be  accom- 
plished is  unknown. 

10.  The  magnitude  of  ipsative  scores  must  never  be  confused  with  the 
magnitude  of  the  absolute  measures  fo.  the  same  set  of  variables.  It  is 
quite  possible  that  a person  with  a low  ipsative  score  on  a particular 
trait  may  actually  possess  more  of  the  trait  than  a person  having  a much 
higher  ipsative  score. 

11.  Although  nonipsative  measures  contain  more  information  than 
ipsative  measures,  it  is  not  usually  an  easy  task  to  develop  absolute 
measures  that  correspond  to  the  variables  in  an  ipsative  set.  It  was  the 
difficulty  of  obtaining  valid  absolute  measures  that  lead  to  the  develop- 
ment of  some  of  the  available  ipsative  instruments,  such  as  attempts  to 
eliminate  the  social  desirability  factor.  Hence,  some  traits  that  may  be 
relatively  easily  compared  using  ipsative  techniques  may  be  very  difficult 
to  assess  validly  using  Instruments  designed  to  yield  more  direct  or 
absolute  measures. 

Seven  other  studies  were  reviewed  that  also  investigated  the  properties 
and  uses  of  ipsative  scores.  Block  (1957)  reported  the  results  of  two 
studies  in  which  ipsative  ratings,  treated  normatively,  were  correlated 
with  corresponding  normative  ratings  in  a test  of  the  functional  equivalence 
of  the  two  forms  of  measurement.  Both  of  the  analyses  showed  an  almost 
complete  equivalence  between  the  two  methods.  Some  of  the  advantages  of 
the  ipsative  approach  were  also  presented. 

Horst  and  Wright  (1959)  reported  on  the  comparative  reliability  of 
an  arithmetically  ipsatized  rating  scale  and  its  experimentally  Ipsative 
counterpart.  The  experimentally  ipsative  scores  were  obtained  from  the 
forced  choice  format  Edwards  Personal  Preference  Schedule,  the  individual 
items  of  which  were  also  administered  in  rating  scale  form  to  obtain  inter- 
active scores.  The  interactive  scores  were  standardized  by  variable  to 
produce  normative  scores,  and  these  were  then  arithmetically  ipsatized  over 
persons  to  produce  normative  ipsative  measures.  It  was  found  that  the 
average  reliability  of  the  variables  for  the  arithmetically  ipsatized 
rating  scale  form  was  .87,  while  for  the  experimentally  ipsative  scale 
it  was  .78,  even  though  the  administration  time  for  the  rating  scale  was 
only  about  one-third  that  of  the  EPPS,  They  concluded  that  any  advantage 
which  the  forced  choice  type  of  self-appraisal  instrument  may  have  over  the 
arithmetically  ipsatized  rating  scale  must  be  other  than  that  of  greater 
reliability.  They  also  suggested  that  other  possible  advantages  be 
investigated  in  further  research.  In  a related  report,  Wright  (1961) 
compared  the  three  types  of  measures  with  respect  to  their  intercorrelations 


XT-5 


and  factor  structures.  The  data  suggested  tliat  the  normative  approach 
provided  more  significant  measures  since  one  less  factor  was  required  to 
extract  all  of  the  approximate  reliable  variance  from  the  intercorrelations 
of  the  experimentally  ipsative  units  than  from  the  normative  and  arith- 
metically ipsative  scores.  An  inspection  of  the  unrotated  factor  patterns 
showed  that  the  first  normative  units  factor  was  not  adequately  matched  by 
either  a normative  ipsative  or  an  experimentally  ipsative  factor.  It  was 
tentatively  identified  as  a factor  of  social  desirability,  the  attempted 
minimization  of  which  was  the  reason  the  EPPS  was  developed  in  forced 
choice  format. 

The  equivalence  of  ipsative  and  normative  fJersonality  measures  was  also 
studied  by  Heilbrun  (1963)  with  regard  to  interscale  correlation  and  relative 
validities.  Using  normative  check  lists  and  forced  choice  ipsative  Q -sorts, 
the  results  were  interpreted  as  supporting  the  use  of  ipsative  measures  for 
normative  predictions.  Concerning  reliability,  Tenopyr  (1968)  noted  that 
the  practice  of  resorting  to  stability  coefficients  as  reliability  estimates 
for  ipsative  scores  is  not  a satisfactory  method  by  itself  since  these 
coefficients  are  subject  to  scale  interdependency.  He  suggested  that  the 
recommended  practice  for  establishing  the  reliability  of  ipsative  inventory 
scales  should  involve  the  establishment  of  internal  consistency  and  stability 
for  the  scales  prior  to  putting  them  into  the  forced  choice  form. 

Two  reports  mentioned  the  "degree  of  ipsativity."  Smith  (undated,  but 
around  1965)  reviewed  the  relevant  literature  describing  the  mathematical 
and  empirical  properties  of  ipsative  and  nonipsative  measures.  The  review 
led  to  the  explication  of  a simple  procedure  for  quantifying  the  "degree 
of  ipsativity"  in  psychological  measurement  instruments.  After  evaluating 
several  published  research  studies  against  the  index,  he  concluded  that 
purely  ipsative  tesr  instruments  possess  such  extensive  psychometric  and 
statistical  limitations  that  utilization  of  such  instruments  is  not  advisable. 
Hicks  (1970)  came  to  the  same  conclusion  in  what  could  be  a later  report 
on  the  same  study.  He  went  on  to  suggest,  however,  that  ipsative  tests 
should  be  used  only  in  situations  where  it  has  been  demonstrated  that: 
significant  response  bias  exists;  this  bias  reduces  validity;  and  an 
ipsative  format  successfully  reduces  bias  and  Increases  validity  to  a 
greater  extent  than  do  nonipsative  controls  for  bias.  Since  Hicks  felt 
that  little  of  the  research  utilizing  ipsative  measures  fulfilled  these 
requirements,  he  believed  that  it  is  necessary  to  reevaluate  thoroughly 
the  extensive  body  of  research  that  has  used  purely  ipsative  forced  choice 
tests  and  that  have  employed  statistical  techniques  predicated  upon  assump- 
tions which  such  instruments  necessarily  violate.  It  may  be  noted  that 
the  conclusions  of  both  Smith  and  Hicks  are  somewhat  more  extreme  than  those 
of  Clemans  (1965). 


Data  Analyses 


Generally,  reports  on  data  analyses  were  beyond  the 
review  unless  specifically  connected  with  some  aspect  of 
construction. 


scope  oi  this 
questionnaire 


XI-6 


Most  that  were  located  were 
junction  with  their  related 
here , 


discussed  above  in  other  chapters  in 
topic.  Four  articles,  however, may  be 


con- 

noted 


Stevens  (1946)  pointed  out  four  kinds  of  measurement  scales; 
nominal,  ordinal,  interval,  and  ratio.  Appropriate  statistical  analyses 
are  associated  with  each.  Hence,  the  data  analysis  limitations  of  various 
forms  of  questionnaires  should  be  considered  before  an  instrument  is 
designed.  For  example,  fxom  a power  of  the  statistic  point  of  view  less 
can  be  done  with  open-ended  questions  than  with  ranking  questions. 


A statistical  measure  most  appropriately  used  in  conjunction  with 
the  method  of  paired  comparisons  was  reported  by  Balinsky,  Blum,  and 
Dutka  (1951).  Called  the  coefficient  of  agreement,  it  enables  the 
experimenter  to  measure  the  degree  and  test  the  signliicance  of  agreement 
among  observers  as  to  their  preferences  for  a series  of  items  offered  for 
consideration.  It  can  be  readily  used  in  the  construction  and  testing 
of  attitude  and  opinion  scales. 


Litwak  (1956)  points  out  that  ^ hoc  rules  on  question  wording  can 
be  systematically  defined  by  the  constraints  of  latent  structure  analysis. 
And  Reynolds  (1966)  attempts  to  determine  the  degree  of  difference  between 
two  ratings  required  for  statistical  significance  with  samples  of  varying 
sizes. 


m 


w 


Chapter  XII 

ElECOMMENDED  AREAb  FOR  FURTHER  RESEARCH 


This  chapter  contains  recommendations  for  further  research  based 
upon  a lack  of  empirical  research  or  contradictions  in  results  of  the 
studies  reviewed  in  the  previous  cliapters.  The  section  headings  used 
correspond  to  the  previous  chapters. 


Advantages  and  Disadvantages  of  Various  Types  of  Questionnaires 

1.  Because  cf  the  lack  of  stress  in  this  review  on  mail  question- 
naires, only  a few  articles  were  discussed  in  Chapter  II.  Addi- 
tional information  could  probably  be  found  by  extending  the 
literature  search. 

2.  More  research  appears  to  be  needed  on  the  benefits,  validity,  and 
reliability  of  combinations  of  questionnaire  methods,  for  example 
interview  and  self-administered  questionnaires. 

Selection  of  Questionnaire  Items  to  Be  Used 

1.  More  research  appears  to  be  needed  on  the  comparison  of  ranking  and 
rating  techniques.  For  example,  there  is  some  evidence  that  con- 
clusions based  upon  a single  judge  d’f-Fer  from  those  based  upon 
multiple  judges.  Also,  more  studies  need  to  be  designed  where  the 
items  to  be  ranked  or  rated  are  as  comparable  as  possible. 

2.  Contradictary  evidence  was  obtained  regarding  the  coniparison  of 
ranking  and  paired  comparison,  which  suggests  further  research. 

3.  More  studies  need  to  be  conducted  on  the  comparison  of  rating 
scales  and  forced  choice  items,  where  identical  items  are  used 
in  both  forms. 

4.  Since  few  studies  were  located  on  the  comparison  of  rating  scales 
and  card  sorts,  rating  scales  and  semantic  differential  items,  and 
rating  scales  and  check  lists,  more  studies  can  be  carried  out  in 
these  areas. 

5.  More  research  is  needed  on  the  comparison  of  multiple  choice  items 
with  other  item  types. 

6.  A more  critical  and  detailed  review  is  needed  regarding  issues 
rijlateu  to  forced  choice  and  paired  comparison  items. 


i 


XII-1 


7.  A more  extensive  literature  review  regarding  tiie  use  of  the  seman- 
tic differential  might  be  in  order. 

8.  Because  of  the  focus  of  this  study, few  articles  were  located  on 
card  sorts  and  projective  items,  suggesting  that  a more  complete 
literature  review  could  be  conducted. 

9.  Few  studies  concerning  check  lists,  open-ended  items,  rearrangement 
items,  and  matching  items  were  uncovered,  suggesting  another  possi- 
ble area  of  additional  research. 

Comparison  of  Scaling  Techniques 

1.  This  review  did  net  stress  scaling  techniques  on  which  many  articles 
have  been  written.  It  is  suggested  that  a review  of  the  literature 
could  be  done  stressing  just  scaling  techniques  in  regard  to  question- 
naire construction. 

Effects  of  Variation  in  Presentation  of  Questionnaire  Items 

1.  Even  though  the  review  of  the  literature  revealed  that  pictures 
can  be  effectively  employed  in  questionnaires,  the  review  should 
be  extended  to  determine  other  modes  of  item  presentation  that  can 
be  used  in  questionnaires  (i.e.  use  of  tape  recorders  or  physical 
objects) . 

2.  Follow-up  research  is  warranted  in  the  area  of  question  stem  word- 
ing. Many  important  issues  have  been  raised  by  the  studies  presen- 
ted, but  there  has  been  little  systematic  pursuit  of  the  issues  to 
a conclusion. 

3.  Since  no  research  studies  were  uncovered  which  examined  the  wording 
of  response  alternatives,  research  needs  to  be  done  in  this  area. 

4.  More  attention  has  been  devoted  to  measures  of  item  difficulty  than 
to  the  effects  of  item  difficulty  on  questionnaire  responses. 
Additional  attention  needs  to  be  focused  on  item  difficulty  response 
tendencies  such  as  acquiescence,  "don't  know,"  and  "no  responses." 

5.  The  effects  of  the  length  of  a question  stem  is  an  under-researched 
area.  Studies  should  be  conducted  to  the  point  where  generalized 
conclusions  can  be  made. 

6.  Experiments  controlled  for  subjects'  characteristics,  topical  area, 
scale  length  and  instrument  should  be  conducted  to  determine  the 
effects  of  the  order  of  response  alternatives. 

7.  No  research  was  uncovered  relative  to  adjective  location  in  the  stem 
of  the  question  versus  adjective  location  in  the  response  alterna- 
tive. Such  research  needs  to  be  done. 


Number  of  Response  Alternatives  and  Response  AnchorlnR 


1.  Additional  research  in  the  area  of  the  optimal  number  of  response 
alternatives  to  use  is  warranted.  This  research  should  cover: 
the  different  types  of  rating  scales;  various  topical  areas  of 
research;  and  subjects  with  different  ability,  educational  and 
sociodemographic  characteristics.  From  such  research  information 
would  be  available  regarding  the  optimal  number  of  response  alter- 
natives to  employ  for  any  specific  type  of  investigation  situation. 

2.  Additional  work  needs  to  be  done  on  the  use  of  balanced  versus  un- 
balanced scales. 

Order  of  Perceived  Favorableness  of  Commonly  Used. Words  and  Phrases 

1.  Even  though  extensive  work  has  been  done  on  the  order  of  perceived 
favorableness  of  commonly  used  words  and  phrases,  individual  in- 
vestigators may  want  to  determine  the  order  of  perceived  favorable- 
ness of  words  which  are  not  included  in  the  lists  in  Chapter  VII 
and  which  are  commonly  used. 

Considerations  Related  to  the  Physical  Characteristics  of  Questionnaires 

1.  Research  needs  to  be  conducted  in  regard  to  the  location  of  response 
alternatives  relative  to  the  question  stem. 

2.  Additional  studies  need  to  be  carried  out  to  determine  the  effect 
that  the  length  of  a questionnaire  has  on  both  the  respondents' 
mptlvation  and  on  the  return  of  mailed  questionnaires. 

3.  Aiiother  possible  area  of  research  would  be  to  determine  the  relations 
o|  questionnaire  length  to  response  consistency  and  validity. 

4.  Systematic  research  needs  to  be  done  on  the  physical  appearance  of 
questionnaires  including  type  size,  spacing,  color,  type  of  paper 
and  the  use  of  pictures. 

Considerations  Related  to  the  Administration  of  Questionnaires 

1.  More  systematic  research  is  needed  to  determine  the  range  of  varia- 
tions in  Instructions  that  may  affect  the  results  obtained  from 
questionnaires  and  on  the  effects  of  variations  in  respondent  under- 
standing of  instructions. 

2.  Further  research  is  needed  on  the  effects  of  administration  time  on 
subject's  motivation,  and  on  the  effects  of  setting  time  limits 

for  completing  questionnaires. 

3.  No  studies  were  uncovered  that  were  concerned  with  the  effects  of 
the  administrators  of  questionnaires  in  the  military  setting. 

For  example,  the  military  rank  of  the  person  administering  a ques- 
tionnaire may  have  an  effect,  as  might  whether  the  administrator 
is  in  the  military  or  not. 


XII-3 


4.  It  is  apparent  that  additional  research  is  needed  on  the  effects 
of  administrative  conditions.  Such  research  should  include  the 
study  of  fatigue  factors. 

Characteristics  of  Respondents  that  Influence  Questionnaire  Results 

1,  Controls  for  all  types  of  response  bias  need  research. 

2.  The  question  of  "Is  attitudinal  bias  and  characteristics  bias  auto- 
matically eliminated  with  stringent  sampling  controls  or  must 

each  instrument  take  this  into  account?"  needs  to  be  resolved. 

Considerations  Related  to  the  Evaluation  of  Questionnaire  Results 

1.  A more  extensive  review  should  be  made  of  work  related  to  the  prop- 
erties and  uses  of  ipsative  scores  and  research  should  be  under- 
taken to  fill  the  gaps  since  procedures  and  techniques  producing 
such  scores  are  in  wide  use. 

2.  The  literature  review  could  be  expanded  to  include  scoring  and 
data  analysis,  as  related  to  questionnaire  construction. 

General  Recommendations 


1.  The  literature  review  could  be  expanded  to  cover  citations  that 
were  not  abstracted. 

2.  The  present  bibliography  could  be  refined,  maintained,  and  updated. 
Possibly  it  could  be  computerized  so  that  requests  for  needed  in- 
formation could  quickly  be  answered. 

3.  The  present  literature  review  could  be  reviewed  by  senior  consul- 
tants in  the  field  and  expanded  or  modified  on  the  basis  of  their 
suggestions . 

4.  Many  conclusions  presented  in  this  review  could  be  tested  in  relation 
to  the  military  situation. 

5.  An  attempt  could  be  made  to  collect  data  about  relevant  issues  on 
questionnaire  construction  from  groups  who  routinely  administer 
questionnaires  but  who  might  not  publish  their  findings. 


XII-4 


BIBLIOGRAPHY 


Over  500  references  are  cited  in  this  report.  These  references  are 
asterisked  in  the  following  bibliography  of  over  2,000  citations.  For 
those  citations  for  which  abstracts  were  prepared  or  were  available, 
the  information  explained  below  is  also  presented: 

1.  The  first  code  at  the  top  center  of  each  citation  indicates  the 
perceived  relevance  (H  for  High,  M for  moderate,  N for  negligible,  NA 

for  those  found  to  be  not  applicable)  of  the  citation,  as  estimated  from 
the  title  (T)  , abstract  (A),  or  report  (R) . 

I 

2.  The  second  code  at  the  top  right  of  each  citation  indicates 
the  technical  categories  which  the  citation  appeared  to  address,  from  the 
following  list: 

1.  Type  of  Instrument 

2.  Response  Form 

3a.  Number  of  Response  Alternatives 

3b.  Order  of  Response  Alternatives 

3c.  Order  of  Questions 

3d.  Adjectives  in  Stem  vs.  in  Response  Alternatives 

3e.  Location  of  Response  Alternatives  Relative  to  Stems 

3f.  Response  Anchoring 

3g.  Miscellaneous  Format  Considerations 

4.  Clarity 

5.  Instrument  Length 

6.  Perceived  Favorableness  of  Adjectives 

7.  Administration  of  Questionnaires 

8.  Evaluation  of  Questionnaires 

9.  Personnel  Characteristics 

10.  Personnel  Attitudes 

11.  Motivation 

12.  Bias 

13.  Other  Sources  of  Error 

14.  Questionnaire  Development  Processes 

15.  Miscellany 

16.  Reference  Source 

17.  Questionnaire  Construction  Guidelines 

18.  Found  to  be  Not  Applicable 

3.  Next  the  citation  is  given,  in  standard  American  Psychological 
Association  format. 

4.  Following  the  citation  itselr  are  selected  descriptive  words 
or  phrases  further  identifying  the  subject  of  the  citation. 

5.  Finally,  the  source  of  the  abstract  is  noted. 


B-1 


Abbott,  K.[).  Stylistic  n>sponse  set  variance  and  trait  inference  from  the 
study  of  values.  Psyclioloeica  1 Reports,  1970,  2J_,  911-914. 

Response  Bias  1^2 

Psychological  Reports,  27,  p.  911  (Rev.  trom  rept.)  R-M 


Abeles,  H.  F.  A facet-factorial  approach  to  the  construction  of  rating 
scales  to  measure  complex  behavior.  Journal  of  Educational  Measurement.  1973, 
J[0(2),  143-151. 


Abelson,  R.  P.  A technique  and  a model  for  mult i -dimen siona 1 attitude  scaling 
Public  Opinion  Quarterly.  1954,  _1^,  405-418. 


Abrahams,  N.  M.  & Lacey,  L.  A.  The  Navy  Adjective  List  as  a predictor  of 
enlisted  retention.  U.A.  Naval  Personnel  and  Training  Research  Laboratory, 
1972.  Research  Memorandum  73-2-1-7. 

Personality  Measures  18 

Psych . Abst . . , 49,  #12129  R-N 


Abrams,  J.  An  evaluation  of  alternative  rating  devices  for  consumer  research. 
Journal  of  Marketing  Research,  1966,  2>  189-193. 

Instrument  Format,  Multiple  Choice  Items,  Rating  Scales,  3f 
Response  Alternatives,  Validity 


Abul-Ela,  A.  -L.  , A.,  Greenberg,  B.G.  & Horvitz,  D.  G.  A multi-proportions 
randomized  response  model.  Journal  of  the  American  Statistical  Association. 
1967,  62,  990-1008. 


* Ace,  M.E.,  & Dawis,  R.V.  The  con*:ributions  of  questionnaire  length,  format, 
and  stype  of  response  inconsistency.  Educational  and  Psychological  Measure- 
ment. 1972,  32(4),  103-111. 

Instrument  Length,  Instrument  Format,  Investigator  Error,  3a,  5,  8 

Scoring 

ORA  R-M 


Ace,  M.  C.,  & Dawis,  R.V.  Item  structure  as  a determinant  of  item  difficulty 
in  verbal  analogies.  Educational  & Psychological  Measurement,  1973,  33 , 

143-149. 

Achievement  Measures  18 

ERIC  Document  Reproduction  Service  ED  073  050  A-N 


Adams,  C.  R.,  & Smeltzer,  C.  H.  The  Scientific  construction  of  an  inter- 
viewing chart.  Personne 1 , 1936,  1^,  14-19 


Adams,  E.  L.,  Jr.  Use  of  student  reaction  questionnaires  at  service  schools. 
Military  Review,  1950,  ^(11),  58-62. 

Questionnaire  Theory  and  Development  17 

Psych . Abs t . , 24  , # 4350  (Rev.  from  rept.)  R-H 

Adams,  H.E.,  & Kirby,  A.  C.  Manifest  anxiety,  social  desirability,  or 
response  set.  Journal  of  Consulting  Psychology,  1963,  ^(1),  59-61. 

Response  Bias,  Question  Stem  12,  3g 

Psych.  Abst.  , 37^,  #8006  A-H 


Adams,  H.  F.  An  objectivity-subjectivity  ratio  for  scales  of  measurement. 
Journal  of  Social  Psychology,  1930,  1^,  122-135. 


Adams,  H.  F.  Validity,  reliability,  and  objectivity.  Psychological  Mono- 
graphs: General  and  Applied.  1936,  t£j_,  329-350. 


-2- 


Adams,  J.  F.  An  evaluation  of  the  effect  of  level  of  item  difficulty  on 
various  indices  of  item-discrimination.  Dissertation  Abstractj,  1959,  ^ 
1066-1067 . 


* Adams,  J.  S.  An  experiment  on  question  and  response  bias.  Public  Opinion 
Quarterly , 1956,  20,  593-598. 

Response  Alternatives,  Response  Bias  3g,  12 


Adkins,  D.  C.  A comparative  study  of  methods  of  selecting  items.  Columbus, 
Ohio:  Ohio  State  University  Library,  1937. 

N/A 


Adkins,  D.  C.  Needed  research  on  examining  devices.  American  Psychologist, 
1948,  3,  104-106. 


Adkins,  D.  C.  A commentary  on  multiple-choice  item  criteria.  Public 
Personnel  Review,  1958,  j^,  296-298. 

Achievement  Measures 

Psvch.  Abst..  33,  # 11.072  A-N 

The  Advertising  Research  Foundation.  Sources  of  Published  Advertising 
Research . 1960. 


The  Advertising  Research  Foundation.  Journal  of  advertising  research 
cumulative  index.  (Index  to  all  articles  in  the  Journal  from  1960  on.) 


Ager,  J.,  Reece,  M. , & Saltz,  E.  Studies  of  forced-choice  methodology: 
Individqal  differences  in  social  desirability.  Educational  and  Psychological 
Measuren|ent . 1962,  2^,  365-370. 


•3- 


* Agostini,  J.  M.  The  case  for  direct  questions  on  reading  habits.  Journa 1 
of  Advertising  Research,  1962,  4(2),  28-33. 

Semantic  Differential  Items,  Validity  15 

ORA  R-M 


Ahlgren,  A.  Reliability,  predictive  validity,  and  personality  bias  of 
confidence-weighted  scores  (ERIC  Document  Reproduction  Service,  ED  033  384). 
Washington,  D.C.:  American  Educational  Research  Association,  1968. 

Achievement  Measures  18 

ERIC  Document  Reproduction  Service,  ED  033  384  A-N 


Aiken,  E.  G.  Alternate  forms  of  a 
ment  of  changes  in  self-description 
177-178. 


semantic  differential  for  measure- 
Psychological  Reports,  1965,  1^, 


Aiken,  L.  R.,  Jr.  Frequency  and  intensity  as  psychometric  response 
variables.  Psychological  Reports,  1962,  1J^(2) , 535-538. 

Investigator  Error,  Response  Alternatives  3g , 8 

Psych.  Abst.  , 2Z>  8007  A-H 


Aikenhead,  G.  S.  A new  methodology  for  test  construction  in  course  eval- 
uation (ERIC  Document  Reproduction  Service,  ED  080  312).  Paper  presented 
at  the  Annual  Meeting  of  the  National  Association  for  Research  in  Science 
Teaching,  Detroit,  Michigan,  1973. 

Achievement  Measures  18 

ERIC  Document  Reproduction  Service,  ED  080  312  A-N 


Air  University,  Educational  Services  Division,  Test  Construction  Unit. 

Test  construction  and  interpretation.  Bulletin  No . 1 . Maxwell  Field,  Ala.: 
Air  University,  1946. 


f ' . 

N/A 

N/A 

T-M 

-4- 


Ajzen,  I.,  & Fishbein,  M.  The  prediction  of  behavior  from  attitudinal  and 
normative  variables.  Journal  of  Experimental  Social  PsycholoRv.  1970,  6, 
466-487. 


Alderfer,  C.  P.  Convergent  and  discriminant  validation  of  satisfaction  and 
desire  measures  by  interviews  and  questionnaires . Journal  of  Applied  Psy- 
chology , 1967,  5_1,  509-520. 


Aleamoni,  L.  M.  MERMAC ; A model  and  system  for  instructional  test  and 
questionnaire  analysis.  Behavior  Research  Methods  & Instrumentation,  '’.971, 
3(4)  , 213-216. 


* Alilunas,  L.  J.  The  personal  setting  as  an  influence  in  the  study  of  the 
attitudes  of  college  freshmen  toward  capitalism.  School  and  Society,  1949, 
284-286. 

Attitude  Measures,  Questionnaire  Theory  and  Development  12,  13 

Psych.  Abst . , 26 , #1433  (Rev.  from  rept.)  R-M 


Alker,  H.  A.,  et  al.  Multiple-choice  questions  and  student  characteristics. 
Journal  of  Educational  Psychology,  1969,  J^(3) , 231-243. 


Allen,  I.  L.  Detecting  respondents  who  fake  and  confuse  information  about 
question  areas  on  surveys.  Journal  of  Applied  Psychology,  1966,  50(6), 
523-528.  ~ 


Allen,  I.  L.,  & Colfax,  J.  D.  Respondent's  attitudes  toward  legitimate 
surveys  in  four  cities.  Journal  of  Marketing  Research,  1968,  431-433. 


Allport,  G.  W.  , Vernon,  P.  E.,  & Llndzey,  G.  Study  of  values.  Boston: 
Houghton  Mifflin,  1960. 


Alpert,  M.  I.  Identification  of  determinant  attributes:  A comparison  of 

methods.  Journal  of  Marketing  Research.  1971,  184. 

Question  Stem,  Validity,  Bibliography,  Instrument  Format  3g,  15,  16 

ORA  R-H 


-5- 


r 


* Altemeyer,  R.  A.  Adverbs  and  intervals:  A study  of  Likert  scales. 

American  Psychological  Association,  Proceedings  of  the  Annual  Convention  of, 
1970,  5(pt.  1),  397-398. 

Adjectives,  Rating  Scales,  Response  Alternatives  2,  3d,  3f,  6 

R-H 


Alutto,  J.  A.  Some  dynamics  of  questionnaire  completion  and  return  among 
professional  and  magerial  personnel:  The  relative  impacts  of  reception  at 

work  site  or  place  of  residence.  Journal  of  Applied  Psychology.  1970,  ^ 
(5) , 430-432. 


* Ambler,  R.  K. , Blair,  J.  T.,  deRivera , J.,  Nelson,  P.  D. , & Schoenberger , 

R.  W.  A note  comparing  the  interview  and  written  questionnaire  techni4ues 
for  identifying  anxiety  toward  flying.  USN  School  of  Aviation  Medical  Research 
Report,  Subtask  1,  No.  20,  Project  No.  NM  16  01  11,  1958. 

Interviews,  Personality  Measures  1 

Psych.  Abst.  , #2216  A-H 


Anastasi,  A.  An  empirical  study  of  the  applicability  of  sequential  ana- 
lysis to  item  selection.  Educational  and  Psychological  Measurement.  1953, 
13,  3-13. 


Question  Stem  3g 

Psych.  Abst.  , # 109 

Anastasi,  T.  E.,  Jr.  Face  to  face  communication.  Cambridge,  Mass.: 
Management  Center,  1967. 


Anastasi,  A.  Psychological  testing.  (3d  ed.)  Macmillan,  1968. 


Anderberg,  M.  R.  Cluster  analysis  for  applications.  New  York:  Academic 

Press,  1972. 


Anderson,  D.  H.,  & Petersen,  D.  F.  Closing  the  communications  gap  with 
item  sampling.  Paper  presented  at  the  Annual  Meeting  of  the  American 
Educational  Research  Association,  New  York,  New  York,  1971. 


* Anderson,  N.  H.  Likableness  ratings  of  555  personality-trait  words. 
Journal  of  Personality  and  Social  Psycliology,  1968,  9^,  272-278. 


Adjectives  6 

Journal  of  Personality  and  Social  Psychology,  9,  p.  272  R-NA 


Andrews,  F.  M.,  Morgan,  J.  N.,  & Sonquist,  J.  A.  Multiple  classification 
analysis:  A report  on  a computer  program  for  multiple  regression  using 

categorical  predictors.  Ann  Arbor,  Mich.;  University  of  Michigan,  Survey 
Research  Center,  Institute  for  Social  Research,  1969. 

Data  Analysis  18 

ORA  R-N 


Andrews,  R.  S.  A psychological  technique  for  selection  of  personnel  to 
perforin  as  observer-recorders  on  field  texts.  QM  R&£  Evaluation  Agency, 
Technical  Report  R-3,  Project  No.  07-98-05-001,  FEA  MRS  4801  (MRS  48-7f), 


1958. 

N/A 

N/A 

N/A 

T-H 

Angoff,  W.  H.  An  empirical  approach  to 

scaling.  Journal  of  Applied  Psychology, 

a problem  of  psycholophysical 
1949,  33,  59-68. 

Angoff,  W.  H.  Test  reliability  and  effective  test  length.  Psychometr ika , 
1953,  18,  1-14. 


Angoff,  W.  H.  Basic  equations  in  scaling  and  equating.  Statistical 
Report  61-51.  Princeton,  New  Jersey:  Educational  Testing  Service,  1961. 


Annis,  A.  D.,  & Meier,  N.  C. 
by  meanji  of  "planted  content. 

65-81. 

The  induction  of  opinion  through  suggestion 
Journal  of  Social  Psychology,  1934.  5. 

N/A 

18 

ORA 

R-N 

-7- 


r 


Anonymous.  Mogul  "semantic  differential"  aims  to  provide  qualitative 
research  data.  Advertising  Age,  1958,  ^(36),  3. 


* Appel,  V.  An  experimental  test  of  the  superiority  and  theory  of  forced- 
choice  questionnaire  construction.  Dissertation  Abstracts,  Sept.  1959,  20 , 
1067-1068. 

Forced-Choice  Items,  True-False  Items,  Validity,  2,  5 

Instrument  Length 


Dissertation  Abstracts,  20 , p.  1067  (Rev.)  • A-H 


Arentsen,  K.  An  investigation  of  the  questionnaire  method  by  means  of 
the  Cornell  Index  (Form  N2) . I.  Review  of  the  literature  and  method; 
Results  for  a group  of  military  recruits.  II.  Results  for  a group  of 
military  medical  patients.  Acta  Psychia trica  et  Neurologica,  Kj^benhavn, 
1957,  32(3),  231-279. 

Military  Personnel,  Personality  Measures  18 

Psych.  Abst.,  33,  # 1240  A-N 


* Armstrong,  J.  S.,  & Overton,  T.  Brief  vs.  comprehensive  descriptions  in 
measuring  intentions  to  purchase.  Journal  of  Marketing  Research,  1971, 

8,  114-117. 

Clarity  3g,  4 

ORA  R-H 


Arnold,  H.  L.  Analysis  of  discrepancies  between  true-false  and  simple 
recall  examination.  Journal  of  Educational  Psychology,  1927,  J^,  414-420. 


Aronson,  A.  H. 
298-305. 


Service  rating  plans. 


Public  Personnel  Review, 


1941,  2, 


Arthol,  R.  P.,  & Bridge,  C.  Project  Echo.  Phase  I.  (AD-657  613).  Santa 
Barbara,  California:  General  Research  Corporation,  June  1967.  Report  No. 

GRC-CR-0040-1  . 

N/A  N/A 

N/A  T-M 


-8- 


Asch,  M.  J.  Negative  response  bias  and  personality  adjustment.  Journal  of 
Counseling  Psychology,  1958,  206-210. 


Asch,  3.  E.  Studies  in  the  principles  oE  judgments  and  attitudes:  II. 

Determination  of  judgments  by  group  and  by  ego  standards.  Journal  of 
Social  Psychology,  1940,  1^,  433-465. 


Asch,  S.  E.,  Block,  H.,  & Hertzman,  M.  Studies  in  the  principles  of  judg- 
ments and  attitudes:  I.  Two  basic  principles  of  Judgment.  Journal  of 
Psychology . 1938,  5,  219-251. 


Aschal,  A.  P.  Relative  values  of  poll-end  and  open-end  questions  in 
search  for  reasons  of  a problem.  Educational  Psychology,  Delhi,  1958, 
55-60. 


Open-Ended  Items,  Response  Bias,  Close-Ended  Items 
Psych.  Abst. , 33,  #8117  (Rev.) 


2,  12 


Ash,  P.,  & Abramson,  E.  The  effect  of  anonymity  on  attitude-questionnaire 
response.  Journal  of  Abnormal  and  Social  Psychology,  1952,  ^(3),  722-723. 


Anonymous  Respondent,  Rating  Scales 
Psych.  Abst.  , #3444 


7,  11 


Ashburn,  R.  An  experiment  in  the  essay-type  question.  Journal  of  Experi- 
mental Education,  1938,  7^,  1-3. 


Ashburn,  R.  R.  , & Bradshaw,  J.  H.  An  e,-.periment  in  the  continuity- type 
question.  Journal  of  Educational  Research,  1953,  47,  201-209. 

Achievement  Measures  18 

Psych,  Abst. ' 28 , #6584  A-N 


Asher,  J.  J.  The  Q by  Q interview:  With  applications  to  personnel  selection 
and  to  decision-making  by  courtroom  iuries  (AD-711  066).  San  Jose,  California; 
San  Jose  State  Ci liege , 1967.  Contract  NONR-4817 (00) . 


-9- 


Asher,  J J.  How  the  applicant's  appearance  affects  the  reliability  and 
validity  of  the  interview.  Educationd  and  Psychological  Measurement, 
1970,  ^(3),  687-695. 


Athey,  K.  R.  , Coleman,  J.  E.,  Reitman.  A.P.,  & Tang,  J.  Two  experiments 
showing  the  effect  of  the  interviewer's  racial  background  on  responses  to 
questionnaires  concerning  racial  issues.  Journal  of  Applied  Psychology , 
1960,  ^(4)  , 244-246. 


Atkin,  C.  K.,  & Chaffee,  S.  H.  Instrumental  response  strategies  in  opinion 
interviews.  Public  Opinion  Quarterly,  1972,  36(1),  69-79. 

Interviews,  Response  Bias,  Investigator  Error  12 

ORA  R'H 


Attneave,  F.  A method  of  graded  dichotomies  for  the  scaling  of  judgments. 
Psychological  Review,  1949,  334-340. 

Scaling,  Paired  Comparison  Items,  Ranking  14,2 

ORA  H'N 


Auld,  F.,  Jr.  Influence  of  social  class  on  personality  test  responses. 
Psychological  Bulletin,  1952,  318-332. 


Ausubel,  D.  P.,  & Schpoont,  S.  H.  Prediction  of  group  opinion  as  a function 
of  extremeness  of  predictor  attitudes.  Journal  of  Social  Psychology,  1957, 
19-29. 


Axelrod,  J.  h.  Attitude  measures  that  predict  purchase.  Journal  of 
Advertising  R. search,  1968,  ^(1),  3-17. 

Rating  Scales,  Paired  Comparison  Items,  Forced  Choice  2 

Items,  Card  Sort,  Open-ended  Items,  Ranking,  Reliability, 

Validity 


ORA 


R-H 


Ayad,  J.  M. , & Farnsworth,  P.R.  Shifts  in  the  values  of  opinion  items; 
furtVier  data.  Journal  of  Psychology,  1953,  36  , 295-298. 


Azrin,  N.  H.,  Holz,  W, , &Goldamond,  I.  Response  bias  in  questionnaire 
leports.  Journal  of  Consulting  PsycholoRv.  1961,  2^,  324-326. 


* Babatz,  S.  The  effect  of  race  of  experimenter,  instruction,  and  compari- 
son population  upon  level  of  reported  anxiety  in  negro  subjects.  Journal 
of  Personality  and  Social  Psychology,  1967,  2(2),  194-196. 

Investigator  Error,  Respondent's  Motivation  9,11 

Journal  of  Personality  and  Social  Psychology,  2> 


Bachrack,  S.  D.,  & Scoble,  H."M.  Mail  questionnaire  efficiency:  Controlled 
reduction  of  nonresponse.  Public  Opinion  Quarterly,  1967,  31,  265-271. 


Back,  K.  W.,  & Gergen,  K.  J.  Idea  orientation  and  ingratiation  in  the 
interview:  A dynamic  model  of  response  bias.  Proceedings  of  the  Social 

Statistics  Section,  American  Statistical  Association,  1963,  284-288. 


* Back,  K.  W.,  Hill,  R.,  & Stycos,  J.  M.  Interviewer  effect  on  scale  repro- 
ducibility. American  Sociological  Review,  1955,  20,  443-446. 

Interviews.  Investigator  Error,  Response  Bias  10,  12 


OR^. 


R-M 


* Baehr,  M.  E.  A simplified  procedure  for  the  measurement  of  employee 
attitudes.  Journal  of  Applied  Psychology,  1953,  37,  163-167. 

Attitude  Measures,  Instrument  Format,  Response  Alter-  2,  3a,  3c,  8 

natives.  Scoring,  Rating  Scales 

ORA 


Baier,  D.  E.  Reply  to  Travers'  "A  critical  review  of  the  validity  and 
rationale  of  the  forced-choice  technique."  Psychological  Bulletin,  1951, 
421-434. 

Forced  Choice  Items,  Questionnaire  Theory  and  Development  14 


ORA 


R-N 


Bain,  R.  Theory  and  measurement  of  attitudes  and  opinions.  Psychologica I 
Bulletin,  1930,  1]_,  357-379. 


Bain,  R.  Stability  in  questionnaire  response.  American  Journal  of 
Sociology,  1931,  37(3),  ^45-453. 


Baker,  E.  L.  The  effects  of  manipulated  item  writing  constraints  on  the 
homogeneity  of  test  items.  Center  for  the  Study  of  Evaluation  Reprint 
Series  No.  11.  California  University,  Los  Angeles:  Center  for  the  Study  of 

Evaluation,  1970.  Spons.  Agency-Of f ice  of  Education (DHEW) , Washington  D.C., 
Bureau  of  Research  (BR-6-1646). 

Achievement  Measures  18 

ERIC  Document  Reproduction  Service,  ED  036  870  A-N 


Bakker,  F.  J.  Emige  probleme  und  methoden  der  itemanalyse.  (Some  problems 
and  metliods  of  item  analysis.)  Diagnostica , 1958,  4,  41-48. 

Achievement  Measures  18 

Psych.  Absl.  , 33,  #9253  A-N 

Baker,  H.  J.  The  construction  and  statistical  interpretation  of  psycholog- 
ical tests.  Review  of  Educational  Research,  1932,  2,  295-299. 


Bulinsky,  B.,  Blum,  M.  L.,  & Dutka , S.  The  coefficient  of  agreement  in 
determining  product  preferences.  Journal  of  Applied  Psychology,  1951,  35, 
348-351. 

Paired  Comparison  Items,  Preference  Measures,  Reliability,  2,  8,  14 
Data  Analysis 

Psych . ,4bs t . , 26,  #4227  (Rev.  from  rept.)  R-M 


* Ballin,  M. , & Farnsworth,  P.  R,  A graphic  rating  method  for  determining 
the  scale  values  of  statements  in  measuring  social  attitudes.  Journal  of 
Social  Psychology,  1941,  1_3,  323-327. 

Rating  Scales,  Scaling  14,  2 


ORA 


R-H 


* Banta,  T.  J.  Social  attitudes  and  response  styles.  Educational  and 
Social  Measurement.  1961,  2^,  543-557. 

Response  Bias,  Rating  Scales  12,  2 

Psych.  Abst. , 3b , #4  GD  43  13  A-M 


Banta,  T.  J.  Critical  note  on  unidimensional  tests.  Psychologica 1 
Reports,  1962,  1J,(2) , 449-450. 

Response  Bias  14^  12 

Psych,  Abst. , 37 , #7975  A-N 


Barclay,  A.,  & Thumin,  F.  J.  A modified  semantic  differential  approach 
to  attitudinal  assessment.  Journal  of  Clinical  Psycholoav.  1963  1973') 

376-378.  ■“ 

Semantic  Differential  Items,  Personality  Measures  2 

Psych.  Abst. , 3£,  #7574  A-N 


* Barclay,  J.  E.,  & Weaver,  H.  B.  Comparative  reliabilities  and  ease  of 
construction  of  Thurstone  and  Likert  attitude  scales.  Journal  of  Social 
Psychology.  1962,  58(1),  109-120. 

Scaling,  Reliability  2,  14 

Psych.  Abst. , 37 , #6564  A-M 


* Barclay,  W.  D.  The  semantic  differential  as  an  index  of  brand  attitude. 
Journal  of  Advertising  Research.  1964,  4(1),  30-33. 

Semantic  Differential  Items,  Validity  2 

ORA  R-H 


Barker,  W.  S.,  & Gorham,  W.  A.  A research  on  the  acceptance  of  QM  cloth- 
ing and  equipment.  Psychological  Research  Associates , PRF  Report  55-3 
1958. 

N/A  N/A 

N/A  T-r 


-13- 


* Barnes,  T.  1.  The  critical  incident  technique.  Sociology  and  Social 
Research,  1960,  44 , 345-347. 

Critical  Incident  Technique 


Barnett,  L.  D.  Responses  to  questionnaires  completed  inside  and  outside 
the  classroom.  The  Family  Life  Coordinator,  1965,  14(3),  130-132. 


Barnhart,  E.  N.  A computational  short  cut  in  determining  scale  values 
for  ranked  items.  Psvchometrika , 1939,  4,  241-242. 


Barrett,  F.  D.,  Jr.  The  Canis  method  of  reducing  bias  in  survey  research. 
Journal  of  Marketing  Research,  1972,  9_,  329-330. 


ORA 


A-NA 


* Barrett,  R.  S.,  Taylor,  E.  K. , Parker,  J.  W.,  & Martens,  L.  Rating  Scale 
Content:  I.  Scale  information  and  supervisory  ratings.  Personnel  Psychol- 

ogy , 1958,  1^.,  333-346. 

Rating  Scales,  Response  Alternatives,  Investigator  Error  3f,  12 

Psych.  Abst.,  33,  #11.  121  A-H 


Barron,  F.  Some  personality  correlates  of  independence  of  judgment. 
Journal  of  Personality,  1952,  387-297. 


Barron,  A.  S.  The  effects  of  three  styles  of  interviewing  on  the  response 
of  women  from  two  contrasting  socioeconomic  groups.  Ph.D.  dissertation, 
Columbia  University,  1957. 


Barron,  B.  A.,  Gluckman,  M. , & Hirsch,  J.  The  construction  and  calibration 
of  behavioral  rating  scales.  Behavioral  Science,  1970,  J^(3),  220-226. 

Reliability,  Data  Analysis 


-14- 


* Bartlett,  C.  J.  Factors  affecting  forced-choice  response.  Personnel 
Psychology . 1960,  399-406. 

Forced  Choice  Items,  Investigator  Error  13 

Psych.  Abst.,  3^,  #3HE99B  A-M 


* Bartlett,  C.  J.,  & Doorley,  R.  Social  desirability  response  differences 
under  research,  simulated  selection,  and  faking  instructional  sets. 
Personnel  Psychology,  1967,  ^(3),  281-228. 

Investigator  Error,  Response  Bias,  Clarity  7,  11 

ORA 


* Bartlett,  C.  J.,  Heermann,  E.,  & Rettig,  S.  A comparison  of  six  different 
scaling  techniques.  Journal  of  Social  Psychology,  1960,  343-348. 

Paired  Comparison  Items,  Ranking,  Reliability,  Rating  2 

Scales,  Scaling 

Psych.  Abst. , #7620  A 


Bartlett,  C.  J.,  Quay,  L.  C.,  and  Wrightsman,  L.  S.,  Jr.  A comparison  of 
two  methods  of  attitude  measurement:  Likert-type  and  forced  choice. 

Educational  and  Psychological  Measurement.  1960,  699-704. 

Forced-Choice  Items,  Response  Bias,  Rating  Scales  2,  12 

Psych.  Abst.,  35 , #3338  A-M 


* Bartlett,  C.  J.,  & Sharon,  A.  T.  Effect  of  instructional  conditions  in 
producing  leniency  on  two  types  of  rating  scales.  Personnel  Psychology. 
1969,  ^(3),  251-263. 

Investigator  Error,  Raters,  Rating  Scales,  Forced  7,  13,  2 

Choice  Items 

Personnel  Psychology.  22,  p.  262  (Rev.  from  rept.)  R-M 


-15- 


Bauman.  L.  J..  Rogers.  T.  F.,  6.  Weiss.  0.  H.  Abstracts  of.  Papcrg  0^ 
Respondent-Interviewer  Interaction  in  th?  Research  • Co 1 um b la 

University:  Bureau  of  Applied  Social  Research,  1971. 

Bibliography 


Baumgarten,  F.  A proverb  test  for  attitude  measurement.  Personnel 
Psychology . 1952.  249-261. 

Open-ended  Items.  Attitude  Measures  18 

Psych.  Abst..  27,  #6798  A-N 

Baryoff.  A.  G..  & Burke.  J.  H.  The  rater's  guide.  Personnel  Psychology, 
1950,  3,  461-465. 

Military  Personnel  1^ 

Psych.  Abst..  25,  #3978  A-M 


* Baryoff,  A.  G.,  Haggerty,  H.  R.,  & Rundquist,  E.  A.  Validity  of  ratings 
as  related  to  rating  techniques  and  conditions.  Personnel  Psychology,  1954, 
7,  93-112. 

Raters,  Forced  Choice  Items,  Rating  Scales,  Validity,  1,  2,  7,  9 

Military  Personnel 

Psych.  Abst. , 2^,  #3086 


* Becker,  B.  W.,  & Myers,  J.  G.  Yeasaying  response  style.  Journal  of  Ad- 
vertisipg  Research,  1970,  1^(6),  31-36. 

Respons.?  Bias,  Semantic  Differential  Items  12 


Becker,  S.  L.  Why  an  order  effect?  Public  Opinion  Quarterly,  1954,  18, 
271-278. 

Check  List,  Instrument  Format,  Investigator  Error,  13,  3a,  5, 

Response  Alternatives,  Instrument  Length 

ORA 


-17- 


Beckwith,  N.  E.,  & Lehmann,  D.  R.  The  importance  of  differential  weights 
in  multiple  attribute  models  of  consumer  attitude.  Journal  of  Marketing 
Research , 1973,  141-5. 


Bejar,  I.  I.,  & Weiss,  D.  J.  Comparison  of  four  empirical  differential 
item  scoring  procedures.  Proceedings  of  the  81st  Annual  Convention  of  the 
American  Psychological  Association,  Montreal,  Canada,  1973,  31-32. 

O 

Scoring 


Beldo,  L.  A.  A multiple  classification  study  of  phrasing  attitude  state- 
ments. Dissertation  Abstracts,  1953,  13,  124  6.  (Abstract  of  Ph.D.  thesis.) 


* Belkin,  M. , & Lieberman,  S.  Effect  of  question  wording  on  response  dis- 
tribution. Journal  of  Marketing  Research,  1967,  4,  312-313. 

Question  Stem,  Interviews  3g 


Bell,  R.  Q.,  Hartup,  W.  W.,  & Crowell,  D.  H.  Mailed  versus  supervised 
administration  of  a projective  questionnaire.  Journal  of  Consulting  Psy- 
chology , 1962,  2^  (3),  290. 


Bell,  F.  0.,  Hoff,  A.  L. , & Hoyt,  K.  B.  Answer  sheets  do  make  a differ- 
ence. Personnel  Psychology,  1964,  17.(1),  65-71. 

Instrument  Fermat,  Military  Personnel,  Scoring  3g 


Beloff,  H..  Two  forms  of  social  conformity:  Acquiescence  and  convention- 
ality. Journal  of  Abnormal  and  Social  Psychology,  1958,  56,  99-104. 


Bellows,  R.  M.,  & Estep,  M.  F.  Employment  psychology:  The  interview.  New 
York:  Holt,  Rinehart  and  Winston,  1954. 


-18- 


Belson,  W.  A.  A volunteer  bias  in  test-room  groups.  Public  Opinion 
Quarterly . 1969,  ^(1),  115-126. 

Respondent's  Motivation,  Response  Bias,  Investigator  Error  11,12,  9 
ORA 


* Belson,  W.  A.  Interviewer  deviation  from  instructions  in  telling  respon- 
dents how  to  use  semantic  differential  scales.  London:  Reprint  Series, 

Survey  Research  Centre,  London  School  of  Economics  and  Political  Science, 
n .d . (a)  . 

Semantic  Differential  Items,  Investigator  Error  7 

ORA  R-H 


* Belson,  W.  A.  Research  into  question  design.  London:  Reprint  Series 

Survey  Research  Unit,  London  School  of  Economics  and  Political  Science, 
n.d.  (b)  . 

Investigator  Error,  Questionnaire  Theory  and  Development, 

Literature  Review  16,  3g 

ORA  R-M 


* Belson,  W.  A.  Respondent  understanding  of  survey  questions.  London: 
Reprint  Series,  Survey  Research  Centre,  London  School  of  Economics  and 
Political  Science,  n.d.  (c) . 

Clarity,  Interviews,  Investigator  Error  4,  13 

ORA  R-H 


Belson,  W.  A. 
1962. 

Studies  in  ' eader ship  , London  : 

Business  Publications  Ltd 

N/A 

N/A 

N/A 

T-M 

Belson,  W.  A.,  & Bell,  C.  R.  A bibliography  of  papers  bearing  on  the 

adequacy  of  techniques  used  in  survey  research.  London:  Oakwood,  1960  (a). 


-20- 


Belson,  W. , & Bell,  C.  R.  A bibliography:  Techniques  used  in  survey 

research.  The  Market  Research  Society,  1960(b). 


* Belson,  W.  A.,  & Duncan,  J.  A.  A comparison  of  the  check-list  and  open- 
response  questionnaire  systems.  Applied  Statistics,  1962,  1JL,(3),  120-132. 

Check  List,  Open-Ended  Items,  Response  Alternatives  2 

ORA  R-H 


Belson,  W.  A.,  & Thompson,  B.  Bibliography  of  methods  of  social  and 
business  research.  London:  London  School  of  Economics  and  Political 
Science  and  Crosby  Lockwood,  1973. 


Belson,  W.  A.,  &Yule,  V.  R.  The  semantic  differential  scaling  system  in 
market  research.  II.  Accuracy  of  ratings.  London:  Reprint  Series, 

Survey  Research  Centre,  London  School  of  Economics  and  Political  Science. 

Clarity,  Semantic  Differential  Items,  Reliability,  4,  7,  3f 

Investigator  Error,  Response  Alternatives 

ORA  R-H 


* Bendig,  A.  W.  The  reliability  of  self-ratings  as  a function  of  the  amount 
of  verbal  anchoring  and  the  number  of  categories  on  the  scale.  Journal  of 
Applied  Psychology,  1953,  2Z.>  38-41. 

Instrument  Format,  Rating  Scales,  Response  Alternatives,  3a,  3f,  8 

Reliability 

Journal  of  Applied  Psychology,  37,  pp.  40-41.  R-H 


* Bendig,  A.  W.  Reliability  and  the  number  of  rating  scale  categories. 
Journal  of  Applied  Psychology.  1954  (a),  38,  38-40. 

Preference  Measures,  Reliability,  Rating  Scales,  3a,  8,  13 

Response  Alternatives 

Psych.  Abst.  , 29 , #81  (Rev.  from  rept.) 


-21- 


* Beniig,  A.  W.  Transmitted  information  and  the  length  of  rating  scales. 
Journal  of  Experimental  Psychology,  1954  (c)  , 303-308. 

Response  Alternatives,  Rating  Scales  3a,  8,  3g 

Journal  of  Experimental  PsvcholoRV , 47 , p.  307  R-M 


Bendig,  A.  W.  Rater  reliability  and  the  heterogeneity  of  the  scale  anchors. 
Journal  of  Applied  Psychology,  1955,  39^,  37-39. 

Investigator  Error,  Preference  Measures,  Raters,  Response  3f,  12,  13,  3g 
Bias,  Response  Alternatives,  Question  Stem 

Psych.  Abst. , 30 , #68  (Rev.) 


Bendig,  A.  W.  "Social  desirability"  and  "anxiety"  variables  in  the  IPAT 

Anxiety  Scale.  Journal  of  Consulting  Psychology,  1959,  23,  377. 


* Bendig,  A.  W.,  & Hughes,  J.  B.  Effect  of  amount  of  verbal  anchoring  and 
number  of  rating-scale  categories  upon  transmitted  information.  Journal  of 
Experimental  Psychology,  1953,  87-90. 

Instrument  Format,  Rating  Scales,  Response  Alternatives  3a,  3f,  8 

Psych.  Abst. , 28 , #5944  (Rev.  from  rept.)  R-H 


Benjamin,  K.  Combining  responses  on  two  forms  of  a questionnaire  with 
options  in  inverse  order.  Public  Opinion  Quarterly,  1949-50,  _H,  688-690. 


Bennett,  A.  S.  Some  aspects  of  preparing  questionnaires.  Journal  of 
Marketing , 1945,  J_0,  175-179. 


Bennett,  A.  S.  Observations  on  the  so-called  cheater  problem  among  field 
interviewers.  International  Journal  of  Opinion  and  Attitude  Research,  1948, 
2(1),  70-84. 


-22- 


Bennett,  E.  M.  , Alpert,  R.,  & Goldstein,  A.  C.  Communications  through 
limited-response  questioning.  Public  Opinion  Quarterly.  1954,  1^,  303-308. 

Close-Ended  Items,  Interviews,  Attitude  Measures, 

Investigator  Error  1 

ORA  R-H 


Bennett,  E.M.,  Blomquist,  R.  L.,  & Goldstein,  A.  C.  Response  stability  in 
limited-response  questioning.  Public  Opinion  Quarterly,  1954,  218-233. 


Bennett,  J.  F.,  & Hays,  W.  L.  Multidimensional  unfolding:  Determining 

the  dimensionality  of  ranked  preference  data.  Psychometr ika , 1960,  25 , 
27-43. 


Bennett,  P.  D.  (Ed.)  Research  methodology:  Longitudinal  analysis.  Paper 

presented  at  the  Proceedings  of  Fall  Conference,  American  Marketing  Associa- 
tion, Marketing  & Economic  Development,  1965,  205-275. 


Benney , M.  Riesman,  D.,  & Star,  S.  A.  Age  and  sex  in  the  interview. 
American  Journal  of  Sociology.  1956,  143-152. 


Benson,  A.  H.  Paired  comparison  approach  to  evaluating  interview. 
Journal  of  Marketing  Research,  1969,  6,  66. 


Benson,  L.  E.  Studies  in  secret -ba Hot  technique.  Public  Opinion 
Quarterly , 1941,  5,  79-82. 


Benson,  P.  H.  A paired  comparison  approach  to  evaluating  interviewer 
performance.  Journal  of  Marketing  Research,  1969,  6(1),  66-70. 


Benson,  P.  H.  How  many  scales  and  how  many  categories  shall  we  use  in 
consumer  research?  - A comment.  Journal  of  Marketing,  1971,  22(4),  59-61. 


Ranking,  Rating  Scales,  Response  Alternatives 


2,  3a,  3f 


Benson,  R.  M.  Effects  of  instructions  and  verbal  modeling  on  health 
information  reporting  in  household  interviews.  Ann  Arbor,  Mich.:  Univer- 
sity of  Michigan,  Survey  Research  Center,  PHS  HS00624-02,  1973. 


Bent,  R.  K.  Various  techniques  of  combining  ratings.  Journal  of  Educa- 
tional Psychology,  1937,  2^,  65-70. 

Data  Analysis  15 

ORA  _ R-N 


Rentier,  P.  M.  Semantic  space  is  (approximately)  bipolar.  Journal  of 
Psychology , 1969,  7_^,  33-40. 

Adjectives,  Semantic  Differential  Items  14 

Journal  of  Psychology,  71 , p. 39-40  (Rev.  from  rept.)  R-H 


Bentler,  P.  M.,  & Jackson,  D.  N.  Identification  of  content  and  style: 

A two-dimensional  interpretation  of  acquiescence.  Psychological  Bulletin. 
1971,  76(3),  186-204. 


N/A 

N/A 

N/A 

T-M 

Bentler,  P.  M. , & Lauoie,  A.  L.  A nonverbal  semantic  differential. 
Journal  of  Verbal  Learning  and  Verbal  Behavior,  1972,  11(4),  491-496. 

Clarity 

4 

ORA 

R-N 

Benton,  A.  L.  Influence  of 

school  children.  Journal  of 

incentives  upon  intelligence  test  scores 
Genetic  Psychology,  1936.  49.  494-496. 

Respondent's  Motivation 

11 

ORA 

R-N 

Benton,  A.  L.,  & Kornhauser,  G.  I.  A study  of  "score  faking"  on  a mechan- 
ical interest  test.  Journal  of  the  Association  of  American  Medical  Colleges, 
1948,  23,  57-60. 


Berlioz,  L.  Liaisons  d'iteni.  (Relat  ionsliip  of  items.)  Bulletin  d 'Etudes 
et  Recherchos  PsycholoRigue , 1962,  n_(2) , 117-136. 


Data  Analysis  ° 

Psych.  Abst  . , 38,  #6042  A-N 

Bernard,  J.  An  experimental  comparison  of  ranking  and  paired  comparisons 
as  methods  of  evaluating  questionnaire  items.  Papers  of  the  American  Socio- 
logical Society,  1933,  2^,  81-84. 

Ranking,  Paired  Comparison  Items  2 


Bernberg,  R.  E.  The  direction  of  perception  technique  of  attitude  measure- 
ment. International  Journal  of  Opinion  Attitude  Research,  1951,  b,  397-406. 


Questionnaire  Theory  and  Development  18 

Psych.  Abst.,  27,  #336  A-N 


Bernhardson,  C.  S.,  & Fisher,  R.  J.  The  relationship  between  personal 
desirability  and  endorsement  with  a forced-choice  technique.  Ps ychologica 1 
Abs tracts  , 1971  , #9073. 

Personality  Measures,  Response  Bias  10,  12 

Psych . Abst . , 46,  #9073  A-H 


Bernstein,  L.  A.  Statistics  for  decisions.  New  York:  Grosset  & Dunlap, 

1965. 


Berrien,  F. 
671)  . New 
TR-5. 


K . The  sensitivity 
Brunswick,  New  York; 


of  employee  attitude  questionnaires  (AD  226 
Rutgers  State  University,  1959,  Report  No. 


Best,  W.  H.  Some  new  directions  in  personnel  appraisal.  Personnel . 1957, 
45-50. 

Rating  Scales  2 

Psych.  Abst.  . 23,  #2229  A-N 


Betts,  G.  L.  Test  calibration  for  categorical  classification.  Educa tional 
and  Psychological  Measurement,  1949,  9,  269-279. 

Scaling  13 

Psych.  Abst. . 26 , #2739  A-N 


* Bevan,  W. , & Avant,  L.  I.  Response  latency,  response  uncertainty,  informa- 
tion transmitted  and  the  number  of  available  judgmental  categories.  Journal 
of  Experimental  Psychology,  1968,  T^,  394-397. 

Response  Alternatives,  Respondent's  Motivation,  3a,  5 

Instrupient  Length 

Journal  of  Experimental  Psychology,  76 , pp . 394-397 


Bevis,  J.  C.  Interviewing  with  tape  recorders.  Public  Opinion  Quarterly. 
1949-50,  13(4),  629-634. 


Bhatt,  I.  D.  Performance  as  a function  of  varying  types  of  incentives 
and  stresses.  Education  and  Psychology  Review,  1965,  _5>  167-173. 


Bickman,  L.,  & Henchy,  T.  (Eds.)  Beyond  the  laboratory;  Field  research  in 
social  psychology.  New  York:  McGraw-Hill,  1972. 


Biddle,  B.  J.  An  application  of  social  expectation  theory  to  the  initial 
interview.  Dissertation  Abstracts,  1958,  l^j  186. 


Bingham,  W.  E.,  Jr.  A study  of  the  relations  which  the  galvanic  skin  re- 
sponse and  sensory  reference  bear  to  judgments  of  the  meaningfulness,  signi- 
ficance, and  importance  of  72  words.  Journal  of  Psychology,  1943,  26  > 21-34. 


-26- 


i 


Bingham,  W.  E.,  Jr.  A study  of  the  effect  of  the  presence  of  the  examiner 
upon  test  scores  in  individual  testing.  Journal  of  Applied  Psychology, 
1944,  28,  471-476. 


* Bingham,  W.  V.  Halo, invalid  and  valid.  Journal  of  Applied  Psychology. 
1939,  23,  221-228. 

Investigator  Error,  Raters  7,  12 

Journal  of  Applied  Psychology.  23 . p.221  (Rev.)  R-H 


* Bittner,  R.  H-,  & Rundquist,  E.  A.  The  rank-comparison  rating  method. 
Journal  of  Applied  Psychology,  1950,  2^,  171-177. 

Ranking,  Paired  Comparison  Items,  Rating  Scales  2,  14 

Psych.  Abst.  , #3979  A-M 


Blake,  R.  H.  A comparison  of  the  test-retest  reliability  of  picture  and 
verbal  forms  of  occupational  interest  inventories.  Dissertation  Abstracts, 
1967,  ^(9-A),  2868. 


* Blake,  R.  Comparative  reliability  of  picture  form  and  verbal  form  interest 
inventories.  Journal  of  Applied  Psychology,  1969,  ^(1),  42-44. 

Interest  Measures , Clarity,  Reliability  8,  4 

Journal  of  Applied  Psychology,  53,  42  (Rev.  from  rept.)  R-H 


Blake,  R.,  & Dennis,  W.  The  development  of  stereotypes  concerning  the 
Negro.  Journal  of  Abnormal  and  Social  Psychology,  1943,  38,  525-531. 


Blalock,  H.  M.  , Jr.,  & Blalock,  A.  G.  (Eds.)  Methodology  in  social  research. 
New  Tork:  McGraw-Hill,  1968. 


* Blankenship,  A.  B.  Does  the  question  form  influence  public  opinion  poll 
results?  Journal  of  Applied  Psychology,  1940(a),  2A,  27-30. 

Clarity,  Investigator  Error,  Question  Stem  4,  12,  3g 

ORA  R-H 


-27- 


Blankenship,  A.  B.  The  effect  of  the  interviewer  upon  the  responses  in  a 
public  opinion  poll.  Journal  of  Consultins  Psychology,  1940(b),  4(4), 
134-136. 


* Blankenship,  A.  B.  The  influence  of  the  question  form  upon  the  response  in 
a public  opinion  poll.  PsycholoRical  Records,  1940(c),  3,  345-422. 


Attitude  Measures,  Clarity,  Question  Stem,  Validity 
ORA 


4,  3g 
R-H 


Blankenship,  A.  B.  Pre-testing  a questionnaire  for  a public  opinion  poll. 
Sociometry , 1940(d),  3,  263-269. 


Blankenship,  A.  B.  The  "sample"  study  in  opinion  research.  Sociometry, 
1940(e),  3,  271-275. 


* Blankenship,  A.  B.  Psychological  difficulties  in  measuring  consumer  pre- 
ference. Journal  of  Marketing,  1942,  6,  66-75. 

Questionnaire  Theory  and  Development  14,  17 


Blankenship,  A.  B.  Consumer  and  opinion  research;  the  questionnaire  tech- 
nique. New  York:  Harper,  1943. 


Blankenship,  A.  B.  The  choice  of  words  in  poll  questions.  Sociology  and 
Social  Research,  1949(a),  12-18. 


Blankenship,  A.  B.  Source  of  interviewer  bias.  International  Journal  of 
Opinion  pnd  Attitude  Research.  1949(b),  3(1),  95-98. 


Blankens.iip , A.  B.  Let's  bury  paired  comparisons.  Journal  of  Advertising 
Research , 1966,  ^(1),  13-17. 

Paired  Comparison  Items,  Validity,  Questionnaire  Theory  2,  14 

and  Development 

ORA  R'N 


-28- 


Blanz , F.,  Ghiselli,  E.  E.  The  mixed  standard  scale;  A new  rating  system. 
Personnel  Psychology,  1972,  2^,  184-189. 


Blankenship,  A.  B. , et  al.  Questionnaire  preparation  and  interviewer 
technique.  Journal  of  Marketing,  1949,  1^,  399-433. 


Blass,  T.,  Pope,  B.,  & S legman,  A.  W.  Verbal  indices  of  interpersonal  im- 
balance in  the  interview.  Proceedings  of  the  Annual  Convention  of  the 
American  Psychological  Association,  1970,  _5(Pt.  2),  525-526. 


Bloch,  E.  L.,  Goodstein,  L,  D.,  Jourard,  S.  M. , & Jaffe,  P.  E.  Comment  on 
"influence  of  an  interviewer's  disclosure  on  the  self-disclosing  behavior 
of  interviewees."  Journal  of  Consulting  Psychology,  1971,  1^(6),  595-600. 


* Block,  J.  A comparison  of  the  forced  and  unforced  Q-sorting  procedures. 
Educational  and  Psychological  Measurement,  1956,  1^,  481-493. 

Card  Sorts  2 

Psych.  Abst. , 22, j #31  A-H 


* Block,  J.  A comparison  between  ipsative  and  normative  ratings  of  personality. 
Journal  of  Abnormal  and  Social  Psychology,  1957,  54,  50-54. 

Questionnaire  Theory  and  Development  14 

Psych.  Abst. , 33 , #806  A-H 


* Block,  J.  An  unprofitable  application  of  the  semantic  differential. 
Journal  of  Consulting  Psychology,  1958,  22,  235-236. 


Check  List,  Semantic  Differential 


2 


Psych.  Abst.  , 21  > #“^780 


A-M 


Block,  J.  The  challenge  of  response  sets.  New  York:  Appelton-Century- 
Crofts,  1965. 

Response  Bias  12 

The  challenge  of  response  sets,  (rev.)  R-N 


Block,  J.  On  further  conjectures  regarding  acquiescence.  Psychological 
Bulletin,  1971,  76(3),  205-210. 


* Bloxom,  B.  Effects  of  anger-arousing  instructions  on  personality  question- 
P^ij-g  performance.  Educational  and  Psychological  Measurement,  1968,  28 ( 3)  , 
735-745. 

Personality  Measures,  Investigator  Error  13 


* Blumberg,  H.  H.,  De  Soto,  C.  B. , & Kuethe,  J.  L.  Evaluation  of  rating  scale 
formats.  Personnel  Psychology.  1966,  1^(3),  243-259. 

Instrument  Format,  Raters,  Rating  Scales,  Response  2,  3b,  3c,  3c, 

Alternatives 

Psych.  Abst . , 41 , #00060  (Rev.  from  rept.) 


Boardus,  E.  S.  Measuring  social  distance.  Journal  of  Applied  Sociology, 
1925,  9’,  299-308. 


Bocher,  W.  Specific  motivational  and  situational  influences  on  the  results 
of  questionnaire  studies.  In  Merz , F.  (Ed.),  Bericht  uber  den  25.  Kongress 
der  deutschen  Gesellschaft  fur  Psvchologie  Munster,  1966,  pp.  519-525. 


Bock,  D.  G.  The  impact  of  rating  errors  on  the  use  of  rating  scales  in 
selected  experiments  in  oral  communication  research.  Dissertation  Abstracts 
International , 1970,  ^(10-A),  4584. 


Bock,  R.,  Dicken,  C.,  6=  Van  Pelt,  J.  Methodological  implications  cf 
content-acquiescence  correlations  in  the  MMPI.  Psychological  Bulletj.n, 
1969,  71,  127-39. 


Bock,  D.,  & Wood,  R.  Test  theory.  Annual  Review  of  Psychology,  1971,  22, 
193-224. 

Achievement  Measures,  Literature  Review  16 


.no. 


Bodi,  M.  J.  Statement  scaling  study  for  14  rating  characteristics  of  the 
Commander's  evaluation  report.  Indianapolis,  Indiana:  U.S.  Army  Enlisted 

Evaluation  Center,  1967.  Technical  Report  No.  89. 

Military  Personnel,  Rating  Scales  14 

Psych.  Abst . , 43 , #7432  (Rev.)  A-M 


Boek,  W.  E.,  & Lade,  J.  H.  A test  of  the  usefulness  of  the  postcard  tech- 
nique in  a mail  questionnaire  study.  Public  Opinion  Quarterly.  1963,  27 . 


303-306. 


Bogart,  L.  No  opinion,  don't  know,  and  maybe  no  answer.  Public  Opinion 
Quarterly , 1967,  332. 


Bohrstedt,  G.  W.  A quick  method  of  determining  the  reliability  and  validity 
of  multi-item  scales.  American  Sociology  Review.  1969,  542-548. 


Bo i sen,  M.  Special  nonresponse  tabulations  of  three  items  on  long  form 
schedules  in  mail  return  and  nonmail  return  sample.  Washington;  Cleveland 
Special  Census,  Bureau  of  the  Census,  Cleveland  Special  Census  Results 
Memorandum  No.  52,  (67-211  (MRD)  ),  1967. 


Boldt,  R.  F.  An  approximately  reproducing  scoring  scheme  that  aligns  ran- 
dom response  and  omission  (ERIC  Document  Reproduction  Service,  ED  057  074). 
Princeton,  New  Jersey:  Educational  Testing  Service,  1971. 

Achievement  Measures  18 

ERIC  Document  Reproduction  Service,  ED  057  074  A-N 


Booker,  H.  S.,  & David,  S.  T.  Differences  in  results  obtained  by  exper- 
ienced and  inexperienced  interviewers.  London:  London  School  of  Economics 
and  Political  Science. 


Borg,  W.  R.,  &Gall,  M.  D. 
York:  David  McKay,  1971. 


Educational  research:  An  introduction . 


New 


Borgatta,  E.  F.  (Ed.)  SocioloRical  methodology.  San  Francisco:  Jossey- 

Bass,  1969. 

Borgatta,  E.  F.,  & Glass,  D.  C.  Personality  concomitant  of  extreme 
response  set.  Journal  of  Social  Psychology.  1961,  213-221. 

Personality  Measures,  Response  Bias  18 

Journal  of  Social  Psychology,  55 , p.  220  R-NA 


Bornstein,  H.,  Jensen,  B.  T.,  Goldstein,  L.  G.,  Dunn,  T.  F.,  & Berkhouse , 
R.  G.  ijvaluation  of  the  basic  military  performance  test.  USA  TAGO 
Personnel  Research  Branch,  Technical  Research  Note  No.  75,  1957. 

Achievement  Measures  18 

Psych.  Abst. . 33 , #2252  A-N 


Borslow,  B.  The  Edwards  Personal  Preference  Schedule  (EPPA)  and  fallability. 
Journal  of  Applied  Psychology,  1958,  22-27. 


Borus,  M.  E.  Response  error  and  questioning  technique  in  surveys  of  earning 
information.  Journal  of  the  American  Statistical  Association.  1970 
566-575.  ■ — 


Bose,  P.  K.  Some  criteria  in  item  selection  techniques.  Indian  Journal 
of  Psychology,  1958,  101-107. 


Achievement  Measures  18 

Psych.  Abst . , 35 , #3948  A-N 


Bottrill,  J.  An  Investigation  of  some  intrinsic  variables  affecting  test 
responses.  Journal  of  Psychology.  1969,  7_1,  83-88. 


-32- 


Boulger,  J.  G.  A comparison  of  two  methods  of  obtaining  factual  and  sub- 
jective life  history  data  in  follow-up  studies:  Structured  interview  vs, 

questionnaire.  Dissertation  Abstracts  International,  1969,  ^(2-B),  1882- 
1833. 

Interviews,  Validity,  Close-ended  items 
ORA 


1,  10 
A-M 


* Boulger,  J.  G.  Comparison  of  two  methods  of  obtaining  life  history  data: 
Structured  interview  versus  questiofmaire . Proceedings  of  the  Annual  Con- 
vention of  the  American  Psychological  Association,  1970,  ^(Pt.  2),  557-558. 

Interviews  1 

Psych.  Abst. , 44 , #18811  A-M 


Bowen,  J.  H.  Familiarity  scale  values  for  420  nouns  in  twelve  combinations 
of  frequency  of  occurrence  and  conceptual  categorization.  Psychological 
Reports , 1969,  25(3),  899-910. 

Questionnaire  Theory  and  Development  14 

ORA  R'M 


Bowers,  D.  , & Fine,  B.  Core  questionnaire  format  and  positive  response  bias. 
Ann  Arbor,  Mich.:  University  of  Michigan,  Institute  for  Social  Research, 

1968.  Research  Bulletin  No.  5. 

N/A  N/A 

N/A  T-H 


Boyd,  H.  W.,  Jr.,  & Westfall,  R.  Interviewers  as  a source  of  error  in 
surveys.  Journal  of  Marketing,  1955,  19(4)  , 311-324. 


* Boyd,  H.  W.,  Jr,,  & Westfall,  R.  Interviewer  bias  once  more  revisited. 
Journal  of  Marketing  Research.  1965(a),  ]_,  249-253. 


9,  12,  6 


Investigator  Error,  Interviews,  Literature  Review 
Bureau  of  Census,  #7111023901 


R-H 


Boyd,  H.  W.,  Jr.,  & Westfall,  R.  Interviewer  bias  revisited.  Journal  of 
Marketing  Research,  1965(b),  7^(1),  58-63. 


Boyd,  J.  L.,  Jr.,  & Shimborg,  B.  Handbook  of  perfornoancc  testing:  A 

practical  guide  for  tost  makers.  Princeton,  N.  J.;  Educational  Testing 
Service,  1971. 

Achievement  Measures  18 

ERIC  Document  Reproduction  Service,  ED  052  220  A-M 


Boyd,  R.  M.  Influence  of  tes;:  form  on  assessment.  Australian  Journal  of 
Education,  1971,  1_5(2)  , 161-170. 

* Bradburn,  N.  M.  Selecting  the  questions  to  be  asked  in  surveys.  Monthly 
Labor  Review,  1970,  27-29. 

Question  Stem  3g 

ORA  8 'll 


* Bradburn,  N.  M.,  & Mason,  W.  M.  The  effect  of  question  order  on  responses 
Journal  of  Marketing  Research,  1964,  57-61. 

Instrument  Format,  Instrument  Length,  Literature  Review,  3c 

Open-ended  Items,  Interviews 

ORA  R'll 


Bradley,  U.  I.,  & McClelland,  J.  N.  Basic  statistical  concepts:  Palo  Alto 
Calif,:  Scott,  Foreman,  1963. 


Braucr,  R.  L.  Preferences  of  occupants  in  military  family  housing  (DAOF 
8131).  Champaign,  Illinois,  OCE  Construction  Engineering  Research  Labora- 
tory, 1973. 

Questionnaire  Theory  and  Development  18 

DDC,  #IV.OF8131  T-N 


-34- 


Braun,  J.  R.  Forced-cho ' ce  self-report  devices:  A look  at  some  unwarranted 

claims.  Measurement  and  Evaluation  in  Guidance,  1969,  2(3),  153-155. 

Forced-Choice  Items,  Investigator  Error  2,  13 


ORA 


R-H 


Brennan,  R.  D.  Trading  stamps  as  an  incentive  in  mail  surveys.  Journal  of 
Marke t ing , 1958,  ^(1),  306-307. 

Respondent' s Motivation  • H 


Brenner,  M-  H.  Test  difficulty,  reliability,  and  discrimination  as 
functions  of  item  difficulty  order.  Journal  of  Applied  Psychology.  1964, 
^(2)  , 98-100. 

Instrum<:nt  Format,  Achievement  Measures,  Reliability  3c 

Psych.  Abst.,  39,  #5039  A-N 


Bressler,  J.  Judgment  in  absolute  units  as  a psychophysical  method. 
Archives  of  Psychology,  New  York,  1933,  No.  152. 


Bricker,  P.  D.  The  identification  of  redundant  stimulus  patterns.  Journal 
of  Experimental  Psychology,  1955,  49,  73-81. 


Brigham,  C.  A.  A study  of  error.  New  York:  College  Entrance  Examination 

Board,  1932. 


Brigham,  J.  C.,  & Cook,  S.  W.  The  influence  of  attitude  on  judgments  of 
plausibility:  A replication  and  extension.  Educational  and  Psychological 

Measurement , 1970,  ^(2),  283-292. 


* Brinkmeior,  I.  H.  Minor  studies  on  objective  examination  methods.  IV. 
Sentence-length  as  a specific  determiner  in  true-false  examina  cions . 
Journal  of  Educational  Research,  1930,  203. 

Investigator  Error,  True-False  Items,  Achievement  3g,  14 

Measures,  Question  Stem 

ORA 


Brinkmeier,  I.  H.,  & Keys,  N.  Circumstantiality  as  a factor  in  guessing 
on  true-false  examinations.  Journal  of  Educational  Psychology,  1930,  2_1, 
681-694. 


Brinkmeier,  I.  H.,  and  Ruch,  G.  M.  Minor  studies  on  objective  examination 
methods.  III.  Specific  determiners  in  true-false  statements.  Journal  of 
Educational  Research,  1930,  22,  110-118. 


Brinton,  J.  E.  Deriving  an  attitude  scale  from  semantic  differential  data. 
Public  Opinion  Quarterly,  1961,  ^(2),  289-295. 


Britt,  S.  H.  Why  it's  best  to  use  combination  research.  Printers'  Ink, 
1954, ’249(4)  , 60-66. 


Broadbent,  D.  E.  Word-frequency  effect  and  response  bias.  Psychological 
Review , 1967  , 2fi(l),  1-15. 

Adjectives,  Response  Bias  6,  12 

ORA 


* Brod,  D.,  Kernoff,  P.,  & Tewillinger , R.  F.  Anxiety  and  semantic  differential 
responses.  Journal  of  Abnormal  and  Social  Psychology,  1964,  ^(5),  570-574. 


Semantic  Differential  Items,  Response  Bias,  Personality  12 

Measures 

Psych.  Abst. , 22.>  il^4647  A-M 


-36- 


Broedling,  L.  Development  of  attitudinal  assessment  techniques  (DN242001). 
Washington,  D.  C.:  Naval  Personnel  R & D Laboratory,  1973  (a). 

N/A  N/A 

N/A  R'NA 


Broedling,  L.  Development  of  a small-sample  methodology  applicable  to 
surveys  of  Navy  personnel  (DN242017) . Washington,  D.  C.:  Naval  Personnel 

R 6c  D Laboratory,  1973  (b)  . 


N/A 

N/A 

N/A 

R-NA 

Broen,  W.  E.,  Jr.,  6c  Wirt,  R,  D.  , Varieties  of  response  sets.  Journal  of 
Consulting  Psychology,  1958,  ^(3),  237-240. 


Brogden,  H.  E.  Variation  in  test  validity  with  variation  in  the  distribu- 
tion of  item  difficulties,  number  of  items,  and  degree  of  their  intefcorrela- 
tion.  j?svchometrika , 1946,  jA,  197-214. 


* Brokaw,  L.  D.  Comparative  validites  of  "short"  versus  "long"  tests. 
Journal  of  Applied  Psychology,  1951,  325-330. 


Validity,  Reliability,  Instrument  Length,  Military  5 

Personnel 

Psych.  Abst. , 26,  #4197  A-M 


Brotemarkle,  R.  A.,  6c  Fernberger , S.  W.  A method  for  investigating  the 
validity  of  the  categories  of  a judgment  test.  Journal  of  Educational 
Psvcholf)gy , 1934,  579-584. 

Response  Alternatives,  Data  Analysis,  Psychophysical  3a,  8,  14 

Measures,  Validity,  Scaling 

ORA  R-M 


-37- 


Brown,  E.  M.  Influence  of  training,  method,  and  relationship  of  the 
halo  effect.  Journal  of  Applied  Psychology.  1968,  ^(3),  195-199. 


Brown,  D.  T.  Stimulus  similarity  and  the  anchoring  of  subjective  scales. 
American  Journal  of  Psychology.  1953,  6^,  199-214. 

Response  Alternatives,  Psychophysical  Measures  3f 


Brown,  J.  M.  Respondents  rate  public  opinion  interviewers.  Journal  of 
Applied  Psychology,  1955,  39(2),  96-102. 


Brown,  M.  L.  Use  of  a postcard  query  in  mail  surveys.  Public  Opinion 
Quarterly . 1965,  635-637. 


Brown,  R.  V.  Evaluation  of  total  survey  error.  Journal  of  Marketing 
Research,  1967,  4(2),  117-127. 


Brown,  R.  W.  Is  a boulder  sweet  or  sour?  Contemporary  Psychology,  1958, 
3,  113-115. 

Semantic  Differential  Items,  Questionnaire  Theory  and  14 

Development 


Brown,  S.  R.  Bibliography  on  Q technique  and  its  methodology.  Perceptual 
and  Motor  Skills,  1968,  26,  587-613. 


* Brown,  S.  R.  The  forced-free  distinction  in  Q technique.  Journal  of  Educa- 
tional Measurement,  1971,  ^(4),  283-287. 


Torced-Choice  Items,  Card  Sorts 

Journal  of  Educational  Measurement,  p.  286 


2,  18 


Brown , W . 
emotional 

33-36. 


P.  Individual  differences  in  associating 
words.  Journal  of  Consulting  and  Clinical 


to  neutral  and 
Pdvchology . 1970, 


34 


Adjectives,  Personality  Measures 

Tnnrnal  of  Consulting  and  ninical  PsycholoaL,  P-33 


Brovming,  R. 
of  sfilecter 
1952. 


C.  eta).  A study  of  officer  rating  methodology;  .Ll. 
rater  characteristics  on  validity  of  ratings.  Report  o 


Effect 

909, 


N/A 


N/A 


N/A 


T-M 


Brownless,  V.  T.,  & Keats,  J 
knowledge  and  other  factors 
1958,  23,  67-73. 


A.  A retest  method  of  studying  partial 
influencing  item  response.  Psychometrika , 


Brueckner , 
items  in  a 


L.  J. , & Hawkinson 
diagnostic  test. 


M.  J.  The  optimum  order  of  arrangement  o 
Elementary  School  Journal,  1934,  34,  351-357 


f 


Brumback,  G.  B.  A 
biographical  data. 
(2),  439-443. 


note  on  criterion  contamination  in  the  validation  of 
Educational  and  Psychological  Measurement,  1969, 


Brunner  G.  A.,  & Carroll,  S.  J.  The  effect  of  prior  notification  on  the 
li  tl«d  add«ss  surveys.  Journal  of  Advert.s.nft  Rese.rcJ;. 

1969,  9(1),  42-44. 


Respondent's  Motivation,  Interviews 
ORA 


9,  U 
R-M 


Brunswick , 


per iments . 
1947. 


E 


<;vst-pmatic  and  representative  desi&n  of  psychological  e^^ 
Berkeley  & Los  Angeles,  Calif 7i  University  of  California  Press 


-39- 


Bruvold,  W.  H.  Category  and  successive  intervals  scales  for  rating 
statements  and  stimulus  objects.  Journal  of  Experimental  Psychology,  1969 
^(2),  230-234. 


Questionnaire  Theory  and  Development,  Scaling 


14 


ORA 


R-N 


* Bruvold,  W.  H.  Rater's  attitudes  and  the  method  of  equal-appearing  inter- 
vals. Proceedings  of  the  Annual  Convention  of  the  American  Psychological 
Association,  1971,  6(Pt.  1),  373-374. 

Attitude  Measures,  Rating  Scales,  Response  Bias  10,  12 

ORA  R-N 


Bruvold,  W.  H.  Consistency  among  attitudes,  beliefs  and  behavior.  Journal 
of  Social  Psychology,  1972,  127-134. 


Buchanan,  P.  C.,  & Wiley,  L.  N.  The  use  of  a multi-purpose  rank-rating 
scale  for  personnel  evaluation  and  preference  research.  American  Psycholo- 
gist, 1948,  3,  345. 

Rating  Scales,  Ranking  2 

American  Psychologist,  3,  p.  345  A-M 


Buchwald,  A.  M.  Variations  in  the  apparent  effects  of  "right"  and  "wrong" 
on  subsequent  behavior.  Journal  of  Verbal  Learning  and  Verbal  Behavior. 
1962,  1(1),  71-78. 

Response  Alternatives,  Investigator  Error,  3a,  11 

Respondent's  Motivation 

Psych.  Abst. , 37_,  #6128  A-M 


* Bucklow,  M.  Staff  reporting:  A new  look  at  the  halo  effect.  Personnel 
Practice  Bulletin,  1960,  1^(4),  29-33. 


Investigator  Error  13 

Psych.  Abst. , 35 . #4045  (Rev.)  A-N 


-40- 


r 


Budd,  W.  C.,  & Blakely,  L.  S.  Response  bias  in  the  Minnesota  Teacher 
Attitude  Inventory.  Journal  of  Educational  Research.  1958,  707-709, 


Response  Bias 

Psych.  Abst.,  #11043 


12 

A-N 


Bueker , K.  Reactions  to  a questionnaire  survey:  Some  sources  of  respondent 
resistance.  Journal  of  Psychiatric  Nursing  & Mental  Health  Services,  1969  7 
(5),  215-221.  ; 


Buel,  W.  D.  A simplification  of  Hay's  method  of  recording  paired  compari- 
sons. Journal  of  Applied  Psychology.  1960,  347-348. 


* Buel,  W.  D.  Preference  indices  stability  in  forced-choice  rating  scale  con- 
struction. Journal  of  Industrial  Psychology.  1963,  1(2),  55-58. 


Forced-Choice  Items,  Check-List 
Psych.  Abst . , 38,  #10.152 


2,  14 
A-H 


Bues,  H.  W.  The  construction  and  validation  of  a scale  to  measure  attitude 
toward  any  practice.  Purdue  University  Studies  for  Higher  Education  XXVT 
1934,  31,  64-67.  ' ’ 


Bureau  of  the  Census.  List  of  books  with  opinions  and/or  information 
relevant  to  questionnaire  design.  Washington,  D.  C.:  Bureau  of  the  Census 

RRB  Report  No.  63-52,  (64-415  (MRD)  ),  1965. 


Bureau  of  the  Census.  Questiomaire  research  reports  --  Series  A,  Report  3: 
Results  of  three  classroom  experiments  with  census ~of  population  housing 
questionnaires . Washington,  D.  C.:  Bureau  of  the  Census,  U.  S.  Department 

of  Commerce,  Social  and  Economic  Statistics  Administration,  Statistical  Res. 
Div.  Response  Research  Staff  Report  No.  72-14. 


Buros,  0.  (Ed.)  The  mental  measurement  year  books.  Highland  Park,  New 

Jersey:  The  Gryphon  Press.  ’ 


-41- 


•V 


r 


Burros,  R.  H.  The  estimation  of  the  discriminal  dispersion  in  the  method 
of  successive  intervals.  Psychometrika . 1955,  20,  299-306. 


Bur  ton , A . C . On  the  science  of  field  testing  as  applied  to  clothing  and 
personal  equipment.  Defense  Research  Board,  1948. 


* Burtt,  H.  E.,  & Gaskill,  H.  V.  Suggestibility  and  the  form  of  the  question. 
Journal  of  Applied  Psychology.  1932,  16(4),  358-373. 

Clarity,  Question  Stem  4,  3g 

Journal  of  Applied  Psychology.  16 , 373  (Rev.  from  rept.)  R-H 


Burwen,  L.  S.,  Campbell,  D.  T.,  & Kidd,  J.  The  use  of  a sentence  completion 
test  in  measuring  attitudes  toward  superiors  and  subordinates.  Journal  of 
Applied  Psychology,  1956,  248-250. 


Bush,  L.  E.  Individual  differences  multidimensional  seal  ig  of  adjectives 
denoting  feelings.  Journal  of  Personality  and  Social  Psychology.  1973,  25(1) 
50-57.  ~ 

Clarity,  Adjectives  4^9 

Journal  of  Personality  and  Social  Psychology.  25(1),  p.  50  R-H 


* Buss,  A.  H.  The  effect  of  item  style  on  social  desirability  and  frequency 
of  endorsement.  Journal  of  Consulting  Psychology,  1959,  510-513. 


Response  Bias,  Question  Stem 

Journal  of  Consulting  Psychology.  23,  p.  512 


12,  3g 
R-H 


* Cahalan,  D.  Effectiveness  of  a mail  questionnaire  technique  in  the  army. 
Public  Opinion  Quarterly,  1951,  15,  575-578. 


Respondent's  Motivation,  Interviews 
Potter,  Shaipe,  Hendee  and  Clark,  1972 


1,  11 
A-M 


r 


T 


1 


* Cahalan,  D. , Tamulonis,  V.,  & Verner,  H.  M.  Interviewer  bias  involved  in 
certain  types  of  opinion  survey  questions.  ~ International  Journal  of  Opinion 
and  Attitude  Research,  1947,  1^(1),  63-77  . 

Investigator  Error,  Response  Alternatives , True-False  2,  12 

Items,  Open-ended  Items,  Multiple  Choice  Items 

Bureau  of  Census  (7110001701)  A-H 


Caldwell,  L.  S.  Military  performance  - Physical  decrement  and  enhancement 
(DA  086071).  Fort  Knox,  Kentucky:  Medical  Research  Laboratory,  June  1973. 


N/A 

N/A 

N/A 

R-NA 

Calhoun,  R.  L.  Item  form  and  item  discriminating  power:  An  experimental 
study.  Dissertation  Abstracts,  1962,  ^(1),  335-336. 

Multiple  Choice  Items,  Question  Stem,  Achievement  Measures  3g 

Dissertation  Abstracts,  ^(1),  pp  • 335-336(Rev.)  A-N 


California  State  Department  of  Education,  Bureau  of  School  Planning. 

Profile  rating  wheel:  An  instrument  to  evaluate  school  facilities.  Revised 
edition  (ERIC  Document  Reproduction  Service,  ED  072  552).  Sacramento: 
California  State  Department  of  Education,  Bureau  of  School  Planning,  1972. 

Data  Analysis  8 

FRIC  Document  Reproduction  Service,  ED  072  552  A-N 


Campbell,  A.  A.  Two  problems  in  the  use  of  the  open  question.  Journal 
of  Abnormal  and  Social  Psychology,  1945,  40(3),  340-343. 


Campbell,  A.  C.  Some  determinants  of  the  difficulty  of  non-verbal  classifi- 
cation items.  Educational  and  Psychological  Measurement,  1961,  H,  899-913. 


Campbell,  D P.  Some  desirable  characteristics  of  interest  inventories  (ERIC 
Document  Reproduction  Service,  ED  033  417).  Paper  presented  at  meeting  of 
American  Personnel  and  Guidance  Association,  Las  Vegas,  Nevada,  1969. 


Campbell,  D.  T.  The  indirect  assessment  of  social  attitudes.  Psychologi- 
cal Bulletin,  1950,  47  (1),  15-18. 


Campbell,  D.  T.,  & Giske,  D.  W.  Convergent  and  discriminant  validation 
by  the  multitrait-multimethod  matrix.  Psychological  Bulletin,  1959, 
81-105. 


Campbell,  D.  T.,  Lewis,  N.  A.,  & Hunt,  W.  A.  Context  effects  with  judg- 
mental language  that  is  absolute,  extensive,  and  ex traexper mentally 
anchored.  Journal  of  Experimental  Psychology,  1958,  220-228. 

Response  Alternatives,  Raters  3a.  3f 


Campbell,  D.  T.,  & Mohr,  P.  J.  The  effect  of  ordinal  position  upon  responses 
to  items  in  a check  list.  Journal  of  Applied  Psychology,  1950,  34,  62-67. 

Check  List,  Response  Alternatives  3b 

ORA  R-H 


Campbell,  J.  T.,  & Rundquist,  E.  A.  Scale  items  for  inclusion  in  forced- 
choice  racing  forms.  American  Psychologist,  1950,  _5,  280. 


Campbell,  D.  T.,  Siegman,  C.  R.,  & Rees,  M.  B.  Direction-of -wording  effects 
in  the  relationship  between  scales.  Psychological  Bulletin,  1967,  293-303. 

Instrument  Format,  Response  Bias  3g,  12 


ORA 


R-N 


Campion,  J.  E.,  Gelfand,  N.  1.,  & Lassoff,  S.  Z.  Biasing  influence  of 
interviewer  expectations  on  interviewee  responses  in  a marketing  study. 
Catalog  of  Selected  Documents  in  Psychology,  1972,  2,  46-47. 


Cannell,  C.,  & Fowler, 
a personal  interview: 
250-264. 


F.  Comparison  of  a se If-enumera tive  procedures  and 
A validity  study.  Public  Opinion  Quarterly.  1963,  27 , 


Cannell,  C.  G.,  Fowler,  F . J . , & Marquis,  K.  H.  The  influence  of  interview- 
er and  respondent  psychological  and  behavioral  variables  on  the  reporting  in 
household  interviews.  Vital  & Health  Statistics.  1968,  Series  2. 


Cannell,  C.  F.,  & Kahn,  ?.  The  collection  of  data  by  interviewing.  In 
Festinger,  L.,  & Katz,  D.  (Eds.),  Research  methods  in  the  behavioral  sciences, 
pp.  327-380.  New  York;-  The  Dryden  Press,  1953. 


Cannon,  D.,  & Olson,  H.  C.  SPANOCON:  Span  of  Control.  II.  Effect  on  relia- 
bility of  free  and  forced  distributions  in  rating.  HumRRO  Research  Memoran- 
dum, Subtask  Spanocon,  Task  11-28,  1961. 


Data  Analysis 

8 

Psych.  Abst.,  37,  #5705 

A-N 

Canter,  F.  M.  , & Canter,  A.  N.  Authoritarian  attitudes  and  adjustment  in 
a military  situation.  United  States  Armed  Forces  Medical  Journal,  1957,  8 

1201-1207. 

Canter,  R.  R.,  Jr.  A rating-scoring  method  for 

of  Applied  Psychology,  1953,  37,  455-457. 

free-response  data.  Journal 

Scoring 

8 

Psych.  Abst.,  29,  #71 

A-M 

Cantrell.  G.  K.  Multiple-choice  item  form  in  relation  to  age,  intelligence. 

and  level  of  education.  Brooks  AFB,  Texas:  USAF 

Aerospace  Medical  Division  (AFSC) , October  1969. 

School  of  Ae.ospace  Medicine 
SAM-TR-69-65. 

N/A 

N/A 

N/A 

T-H 

* Cantril , H.  Experiments  in  the  wording  of  questions.  Public  Opinion  Quarterly, 
1940(a),  4,  330-332. 


Attitude  Measures,  Clarity,  Investigator  Error 
Potter,  Sharpe,  Hendee  & Clark,  1972  (Rev.) 


4,  13,  3g 


R-M 


Cantril,  H.  Problems  and  techniques:  Experiments  in  the  wording  of  questions. 
Public  Opinion  Quarterly,  1940  (b)  , 4(2),  330-332. 


-45- 


r 


1 


Cantril,  E.  GauRiny  public  opinion.  Princeton,  N.  J.:  Princeton  University 

Press,  1947. 

interviews.  Textbook  13,  17 

Bureau  of  Applied  Social  Research  A-H 


Cantril,  H.,  & Fried,  E.  The  meaning  of  questions.  In  E.  Cantrol  (Ed.), 
GauRing  Public  Opinion.  Princeton,  N.J.:  Princeton  University  Press,  1944. 


Capron,  V.  L.  Relative  effect  of  three  orders  of  arrangement  of  items  upon 
pupils'  scores  in  certain  arithmetic  and  spelling  tests.  Journal  of  Educa- 
tional Psychology.  1933,  687-694. 


Carlson,  R.  E.  Effect  of  interview  information  in  altering  valid  impressions. 
Journal  of  Applied  Psychology,  1971,  ^(1),  66-72. 


er,  J.  B.,  & Dudycha , A.  L.  Effects  of  item  format  on  item  discrini- 
on  and  difficulty.  Journal  of  Applied  Psychology.  1973,  ^(1),  116. 


Carper,  J.,  & Doob,  L.  W.  Intervening  responses  between  questions  and 
answers  in  attitude  surveys.  Public  Opinion  Quarterly,  1953-54,  12,  511-519. 


Carroll,  S.  J.,  & Nash,  A.  N.  Effectiveness  of  a forced-choice  reference 
check.  Personnel  Administration,  1972,  15(2),  42-^6. 


Carter,  H.  D.  Importance  and  significance  of  objective  test  items. 
California  Journal  of  Educational  Research,  1955,  6,  61-71. 

18 
A-N 


Achievement  Measures 
Psych.  Abst . , 29 , #8992 


* Carter,  R.  F.,  Ruggela,  W.  L. , & Chaffee,  S H.  The  semantic  differential 
in  opinion  measurement.  Public  Opinion  Quarterly.  1968,  ^(4),  666-674. 

Semantic  Differential  Items,  Response  Alternatives  3f,  14 


ORA 


i 


R-H 


Cartwright,  D. 
American  Journal 


Relation  of  decision- time  to  the  categories  of 
of  Psychology,  1941,  174-196. 


response . 


Response  Alternatives,  Clarity 


3a,  4 


ORA 


R-M 


Casanova,  T.  The  measurement  of  randomness  in  test  items.  Journal  of 
Experimental  Education,  1944  (a),  196-183. 


Casanova,  T.  The  use  of  the  method  of  runs  for  testing  the  randomness  of 
the  order  of  examination  items.  Journal  of  Experimental  Education.  1944  (b)  , 
12,  165-168. 


* Cataldo,  E.  F.,  Johnson,  R.  M.,  & Kellstedt,  L.  A.  Card  sorting  as  a 

technique  for  survey  interviewing.  Public  Opinion  Quarterly,  1970,  34(2) . 
202-215. 

Card  Sorts,  Response  Bias  2,  12 

Bureau  of  Census,  #7122200202  A-H 


* Cattell,  R.  B.  Psychological  measurement:  ipsative,  normative,  and  inter- 
active. Psychological  Review,  1944,  51^,  292-303.  ■* 

Questionnaire  Theory  and  Development,  Scaling  2,  14 

ORA  R-N 

Cattell,  R.  B.  Factor  analysis.  New  York:  Harper,  1952. 


Cattell,  R.  B.  , & Saunders,  D.  R.  Inter-relationship  and  matching  of  per- 
sonality factors  from  behavior  ratings,  questionnaire  and  objective  test 
data.  Journal  of  Social  Psychology.  1950,  31,  243-260. 


* Cavan,  I;.  S.  The  questionnaire  in  a sociological  research  project. 
American  Journal  of  Sociology,  1933,  38(5),  721-727. 


Central  Statistical  Board.  A report  of  the  Central  Statistical  Board  on 
the  returns  made  by  the  public  to  the  Federal  Government.  House  Doc.  No.  27, 
76th  Congress,  1st  Session,  Washington,  D.  C.,  1939. 


N/A 

N/A 

N/A 

T-N 

* Champion,  0.  J.,  and  Sear,  A.  M.  Questionnaire  response  rate:  A method- 
ological analysis.  Social  Forces , 1969,  335-339. 

Instrument  Length,  Respondents'  Motivation  5,  11 

ORA  R-H 


Champney,  H.,  & Marshall,  H.  Optimal  refinement  of  the  rating  scale. 
Journal  of  Applied  PsycholoRv,  1939,  323-331. 

Rating  Scales,  Response  Alternatives  3a 

ORA  R-M 


Chapanis,  A.  Research  techniques  in  human  engineering.  Baltimore:  The 

Johns  Hopkins  Press,  1962. 


Chapman,  L.  J.,  & Bock,  R.  D.  Components  of  variance  due  to  acquiescence 
and  content  in  the  F scale  measure  of  authoritarianism.  Psychological 
Bulletin,  1958,  328-333. 


Chapman,  L.  J.,  & Campbell,  D.  T.  Response  set  in  the  F scale.  Journal  of 
Abnormal  and  Social  Psychology,  1957,  54,  129-132. 


Chapman,  L.  J.,  & Campbell,  D.  T.  The  effect  of  acquiescence  response-set 
upon  relationships  among  the  F scale,  ethnocentr ism,  and  intelligence. 
Sociometry , 1959,  153-161. 


Chein,  R,  An  introduction  of  sampling.  Appendix  B in  C.  In  Selltiz  , M.J., 
Deutsch,  M.  , and  Cook,  S.  W.  (Eds.)  Research  methods  in  social  relations. 
New  York;  Holt,  1959. 


-48- 


Chevan,  A.  Minimum-error  scalogram  analysis.  Public  Opinion  Quarterly, 
1972,  %(3),  379-387  . 


Child,  I.  L.  The  use  of  interview  data  in  qualifying  the  individual's  role 
in  the  group.  Journal  of  Abnormal  and  Social  Psychology,  1943,  38 , 305-318, 


Choo,  T.  Communicator-credibility  and  communication-discrepancy  as  determi- 
nants of  opinion  change.  Dissertation  Abstracts,  1960,  2J,  1246. 


Choppin,  B.  H.,  & Purvis,  A.  C.  A comparison  of  open-ended  and  multiple- 
choice  items  dealing  with  literacy  understanding.  Research  in  the  Teaching 
of  English,  1969,  3(1),  15-24. 


Christal,  R.  E.,  & Madden,  J.  M.  Effect  of  degree  of  Umiliarity  in  job 
evaluation.  USAF  MADD  Personnel  Laboratory  Technical  Ni.  te  No.  60-143,  1960. 


Christie,  R. , Havel,  J.,  & Seidenberg,  B.  Is  the  F-Scale  irreversible? 
Journal  of  Abnormal  and  Social  Psychology,  1958,  5^,  143-159. 


* Clancy,  K.  J.,  & Carson,  R.  Why  some  scales  predict  better.  Journal  of 
Advertising  Research,  1970,  1^(5),  33-38. 


Paired  Comparison  Items,  Rating  Scales,  Response  Bias 


2,  9,  12 


* Clark,  E.  L.  General  response  patterns  t^i  five-choice  items.  Journal  of 
Educational  Psychology,  1956,^,  110-117. 

Multiple  Choice  Items,  Response  Bias,  Response  Alternatives  3a,  12,  3b,  7 
Psych.  Abst. , 31,  #8813  (Rev.  from  rept.)  R-H 


-49- 


Clark,  E.  L.  Item  difficulties  based  on  end  segments.  Journal  of 
Educational  Psychology,  1957,  457-459. 


Achievement  Measures  18 

Psych.  Abst.  , 33^  #2024  A-N 


Clark,  K.  E.,  & Kreidt,  P.  H.  An  application  of  Guttman's  new  scaling 
techniques  to  an  attitude  questionnaire.  Educational  and  Psychological 
Measurement , 1948,  8,  215-223. 

Scaling,  Attitude  Measures  2,  14 
Psych.  Abst.,  23,  #3993  A-M 


Clark,  R.  A.  The  use  of  the  Q-sort  for  collecting  attitude  data  from  com- 
pany commanders  under  field  conditions.  Paper  for  Western  Psychological 
Association  Meeting,  Berkeley,  Calif.,  March  1956. 


N/A 

N/A 

N/A 

T-M 

Clarke,  M.  A.  Arabic  distractors  for  English  vocabulary  tests.  English 
Language  Teaching,  1972,  ^(1),  73-76. 


Clausen,  J.  A.,  & Eord,  R.  N.  Controlling  bias  in  mail  questionnaires. 
Journal  of  American  Statistical  Association,  1947,  42,  497-511. 

Respondent's  Motivation  11 

ORA  R-M 


Cleary,  T.  A.,  & Hilton,  T.  L.  An  investigation  of  item  bias.  American 
Psychologist , 1964,  , 506. 


Clemans,  W.  V.  An  analytical  and  empirical  examination  of  some  properties 
of  ipsative  measure.  Psychometrika , Monogram  Supplement  14,  1965. 


Scoring,  Questionnaire  Theory  and  Development 


14 


N.  The  relation  of  adverb-adiective  word  combinations 
components  ■ "^inceton,  New  Jersey:  Educational  Testing  Service 
Research  Memorandum  55-9. 


to 


their 

1955. 


N/A 

N/A 


N/A 


T-M 


* Cliff,  N . 


Adverbs  as  multipliers. 


Psychological  Review,  1959,  27-44. 


Adjectives 


6 


ORA 


R-M 


Cliff,  N.  Multidimensional  scaling  and  cognition.^  ^ 

judgments  obtained  under  varying  conditions^  Eos  Angeles,  Ca . : University 
of  Southern  California,  1965.  Technical  Report  No.  1. 


N/A 


N/A 


N/A 


T-M 


Cliff,  N.  Multidimensional  scaling  and  cognition:  II.  The  relation  of 
evaluation  to  multidimensional  meaning  spaces.  Los  Angeles,  Ca  . ; University 
of  Southern  California,  1966  (a).  Technical  Report  No.  2. 


N/A 


N/A 


N/A 


T-M 


Cliff  N . Multidimensional  scaling  and  cognition:  III-  Threat  ev^^^y^ 
and  subjective  organization  of  simulated  raids.  Los  Angeles,  Ca . : University 
■i^fTouthern  California,  1966  (b)  . Technical  Report  No.  3. 


N/A 

N/A 

N/A 

T-M 

-51- 


cliff,  N.  Multidimensional  scaling  and  cognition;  Final  report.  Los 
Angeles,  Ca . ; University  of  Southern  California,  1967.  Final  Report. 

N/A  N/A 

N/A  T-M 

* Cliff,  N.  Adjective  check  list  responses  and  individual  differences  in 
perceived  meaning.  Educational  and  Psychological  Measurement,  1968  (a)  ^ 
(2),  1063-1Q77. 

Response  Bias,  Personality  Measures,  Check  List  12 

Educational  and  Psychological  Measurement,  ^(2),  R-N 

p.  1076  tKfcV.  from  rept.) 

Cliff,  N.  The  "idealized  individual"  interpretation  of  individual  differ- 
ences in  multidimensional  scaling.  Psychometr ika . 1958  (b) , 33,  225-232. 

Data  Analysis,  Questionnaire  Theory  and  Development  15 

Psychometrika , 33 , p.  225  R-N 


Cliff,  N.  Liking  judg.nents  and  tiul ti dimens ioaa  1 scaling.  Educational 
and  Psychological  Measurement,  1.969,  .^(1),  87-98. 

Scaling  14 

ORA  R-M 

Cliff,  N.  Scaling.  Annual  Review  of  Psychology,  1973,  24,  473-506. 

Cliff,  N.,  Pennell,  R.,  & Young,  F.  W.  Multidimensional  scaling  in  the 
study  of  set.  American  Psychologist,  1966,  707. 

Instrument  Format,  Investigator  Error,  Question  Stem  3c,  12 

American  PsycholoRist . 21 , p.  707  R-M 


* Cloud,  J.,  & Vaughn,  G.  M.  Using  balanced  scales  to  control  acquiescence. 
Sociometry , 1970,  193-202. 

Attitude  Measures,  Response  Bias,  Scoring,  Question  Stem  12,  8,  3g 

Bureau  of  Census,  #7111033602  (Rev.  from  Rept.)  R-H 


Coble,  J.  A.  Results  of  the  pilot  study  racial  match  experiment.  In 
Lansing,  J.  B.,  Withy,  S.  B.,  Wolfe,  A.  C.,  et  al.  (Eds.)  Working  papers  on 
survey  research  in  poverty  areas.  Ann  Arbor,  Mich.:  University  of  Michigan, 

Institute  for  Social  Research,  Survey  Research  Center,  1971. 


Cohen,  N.  E.  The  relativity  of  absolute  judgments.  American  Journal  of 
Psycnologv , 1937,  93-100. 


Coffin,  T.  E.  Some  conditions  of  suggestlo.i  and  suggestibility:  A study 

of  some  attitudlnal  and  situational  factors  influencing  tl.e  process  of 
suggestion.  Psychological  Monographs:  General  and  Applied,  1941,  No.  241. 


Coffman,  W.  E.  Estimating  the  internal  consistency  of  a test  when  items 
are  scored  2,  1,  or  0.  Educational  and  Psychological  Measurement,  1952,  12 , 
392-393. 

Data  Analysis  18 

Psych.  Abst . , 27,  #5515  A-N 


* Cohen,  L.  Use  of  paired  comparison  analysis  to  increase  statistical  power 
of  ranked  data.  Journal  of  Marketing  Research,  1967,  4,  509. 

Data  Analysis,  Paired  Comparison  Items,  Ranking  2,  8 


* Cohen,  R.  The  position  effects  problem.  Public  Opinion  Quarterly,  1965, 
23,  456. 

Instrument  Format,  Interviews  3c 

Public  Opinion  Quarterly.  19S5,  ^(Fall),  456.  A-H 


-53- 


ERIC  Di)cu.iient  Rcpro  Juc  L ion  Service,  ED  Oil  037  A-N 

Collins,  G.  On  methods.  Journal  of  Advertising 

Paired  Comp.i r i son  Items,  Rankin;;,  Rating  Scales, 

Sea  lin,’, 

ORA 


Research,  1931,  1(3).  28-33. 
2,  17 

R-N 


Collins,  W.  A.  Idiosyncratic  verbal  behavio-  of  interviewers.  Palo  Alto, 

Ca . ; Stanford  University,  Institute  for  Communication  Research,  October,  19b8. 


-54- 


* Coombs,  C.  H.  Psychological  scaling  without  a unit  of  measurement. 
Psychological  Review,  1950,  145-158. 


Scaling 

ORA 


14 

R-H 


Coombs,  C.  H.  A theory  of  psychological  scaling.  Ann  Arbor,  Mich. 
University  of  Michigan  Press,  1952. 


Coombs,  C.  H.,  Milhalland,  J.  E.,  & Womer , F.B.  The  assessment  of  partial 
knowledge.  Educational  and  Psychological  Measurement,  1956,  16,  13-17. 


* Cooper,  A.,  & Cowen,  E.  L.  The  social  desirability  of  trait  descriptive 
terms:  A study  of  feeling  reactions  to  adjective  descriptions.  Journal  of 
Social  Psychology,  1962,  56,  207-215. 


Adjectives,  Rating  Scale ,, Response  Bias 

i 

Psych.  Abst. , #1223 


6,  12 
A-H 


Copeland,  H.  A.  Studies  in  the  reliability  of  p.ersonnel  records.  Journal 
of  Applied  Psychology,  1953,  247-251. 


Corah,  N.  L.,  Feldman,  M.  J.,  Cohen,  I.  S.,  Gruen,  W.,  Meadow,  A.,  & Ringwall, 
E.  A.  Social  desirability  as  a variable  in  the  Edwards  Personal  Preference 
Schedule.  Journal  of  Consulting  Psychology,  1958,  7^,  70-72. 


Response  Bias,  Interest  Measures 
Psych.  Abst. , y/5978 


12 


A-N 


Corey,  L.  G.  How  to  isolate  product  attributes.  Journal  of  Advertising 
Research . 1970,  ^(4),  41-44. 


Data  Analysis 
ORA 


8 

R-N 


-56- 


* Corey,  S.  M.  Signed  versus  unsigned  attitude  questionnaires.  Journal  of 
Educational  Psychology,  1937,  28(2),  144-148. 

Anonymous  Respondent  11 

Potter,  Sharpe,  Hendee , and  Clark,  1972  A-H 


Costin,  F.  The  optimal  number  of  alternatives  in  multiple-choice  achieve- 
ment tests:  Some  empirical  evidence  for  a mathematical  proof.  Educational 
and  Psychological  Measurement,  1970,  30(2),  353-8. 


Costin,  F.  Three-choice  versus  four-choice  items:  implications  for  relia- 
bility aid  validity  of  objective  achievement  tests.  Educational  and  Psycho- 
logical  Measurement,  1972,  ^(4),  1035-8, 


Cos  ton,  M.  L.,  Thorne,  H.  W.,  6,  York,  C.  M.  Study  of  bias  in  new  equip- 
ment trainlgo.  Fort  Banning,  Georgia:  U.S.  Army  Infantry  Board,  April, 

1973.  TECOM  Project  No.  9-CO-OOF-OOO-OiO , USAIB  Project  No.  3331, 


Cotton,  J.  W.  Elementary  statistical  theory  for  behavior  scientists.  Palo 
Alto,  Calif:  Addison-Wesley , 1957. 


* Couch,  A.,  & Keniston,  K. 
as  a personality  variable 
151-174. 

Attitude  Measures,  Response  Bias  2,  12 

Psych.  Abs t . , 34,  #7376  R-H 


Coward,  A.  F.,  & Lord,  F.  M.  Test  reliability  as  a function  of  the  nu.mber 
of  choices  per  item.  Research  Bulletin  50-47.  Princeton,  New  Jersey: 
Educational  Testing  Service,  1950. 


Yeasayers  and  naysayers:  Agreeing  response  set 

Journal  of  Abnormal  and  Social  Psychology,  1960, 


Cox,  W.  E.  Response  patterns  to  mall  surveys.  Journal  of  Marketin.g 
Research,  1936,  3,  392-7. 


-57- 


r 


Cragun,  J.  R.,  & McCormick,  E.  J. 
scale  reliabilities  ani  scale  interrelationships.  USAF  PRL  Technical 
Report,  No.  67-15,  1957. 


* Crawford,  P.  L.  Comparison  of  two  attitude  scaling  methods.  Psychologlca 1 
Reports.  1965,  17(3).  681-632. 

Rating  Scales,  Paired  Comparison  Items  2 

Psych.  Abst. . 40 , #3522  A-H 


Creelman,  M.  B. 

The  experimental  investigation  of  meaning: 

A review  of 

the  literature. 

New  York:  Springer,  1966. 

N/A 

N/A 

N/A 

T-H 

Crehan,  K.  D.,  & Slakcer.  M.  J.  Nore  on  comparison  of  paired -mnltiple- 
responso  items  and  nultiple-ohoi-e  items.  Psychologica  I Repo.'  tc , 1971,  2_8 
(1),  310. 


Crespi,  I.  Us-’  of  a scaling  technique  in  surveys.  Journal  of  Marketing, 
1961,  2^  (5)  , 69-72. 


Crocker,  L.  M. , & Mehrens,  W.  A.  The  comparative  effectiveness  of  different 
item  an.alvsLs  techniques  in  Increasing  change  score  reliability.  Paper  pre- 
sented at  the  Annual  Meeting  of  the  American  Educational  Research  Association, 
New  York,  New  York,  1971. 


Cronbach,  L.  H.  Studies  of  acquiescence  as  a factor  in  the  true-false  \ 

tests.  Journal  of  Education.al  Psychology.  1942,  33,  401-415.  , 


* Cronbach,  L.  J.  An  experimental  comparison  of  the  multiple  true-false  and 
multiple  choice.  Journal  of  Educational  Psychology.  1941(a)  , 533-543. 

Multiple  Choice  Items,  True-False  Items,  Achievement  2 

Measures 

Journal  of  Educational  Psychology,  32 , p.  541  R-H 


-s«- 


Cronbach,  L.  J.  The  true -false  test:  A reply  to  Count  Etoxinod.  Education, 
1941  (b) , 42,  59-61. 


* Croabach,  L-  J.  Response  set  and  test  validity.  Educational  and  Psychologl- 
cal  Measurement , 1946,  6,  475-494. 

Response  Bias,  Multiple  Choice  Items  12 

ORA 


* Cronbach,  L.  J.  Further  evidence  on  respo.ise  sets  and  test  design. 
Educational  and  Psychological  Measurement,  1950,  1^,  3-31. 

Response  Bias  ^2 

ORA 


Cronbach,  L.  J.  Coef f ic ie.nt  alpha  and  the  internal  structure  of  tests. 
Psychometr  ika , 1951,  16.,  297-334. 


Cronbach,  L.  J.  Essentials  of  psychological  testin.g.  (3rd  ed.)  New  York: 
Harper  & Row,  1969. 

Textbook,  Bibliography  15,  16,  17 

ORA 


Cronbach,  L.  J.,  Gleser , C.  C.,  Nanda,  H.,  & Rajaratnam,  N.  The  dependabil- 
ity of  behavioral  measuremenrs : Theory  of  general izability  for  scores  _=y.d 

profifes . New  York:  John  Wiley,  1972. 


Cronbach,  L.  J.,  & Menhl , P.  E.  Construct  validity  in  psychological  tests. 
Psychological  Bulletin,  1955,  281-302. 

Crosby,  R.  W.  Attitude  measurement  in  a bilingual  culture.  Journal  of 
Marketing  Research,  1969,  6,  421. 


-59- 


Croslaad,  H.  The  psychological  tti-'' thods  of  word  association  and  reaction 
time  as  tests  of  deceptian.  University  of  Oregon  Publicat  lo.i , 1929,  1, 
No.  1. 


Cross,  0.  H.  A study  of  faking  on  the  Kader  Preference  Record.  Education- 
al and  Psychological  Measurement,  1950,  hO,  271-277. 


Crowne,  D.  P.,  & Marlowe,  D.  The  approval  motive:  Studies  in  evaluative 
dependence . New  York:  John  Wiley,  1964. 


Crutchfield,  R.  S . , 6<  Gordon,  D.  A.  Variations  in  respondents'  interpre- 
tations of  an  opinion-poll  question.  Interna  r ional Journal  of  Opinion  a.nd 

Attitude  Research,  1947,  _1(3)  , 1-12. 

Attitude  Measures,  Clarity,  Question  Stem,  12,  4 

Inve.st Iga tor  Error 

Census  Abstracts,  71110071  A-N 


Cuber,  J.  F.,  6<  Ge  r bench,  J.  J.  A note  on  cons  .'.s  Lency  in  questionnaire 
responses.  A'nerica.i  Sociological  Review,  l'^46 , JA,  13-15. 


Cu.Tinings,  J.  D.  Naval  personnel  attitudes  toward  environmental  pollution 
(AD  901  680L) . San  Diego,  California:  Navy  Manpower  and  Material  Analysis 
Center  Pacific,  1971.  Report  No.  WSR-72-5. 

Attitude  Measure.3  18 
DDC  A-N 


Curtis,  F.  D.,  Darling,  W.  C.,  & Shearman,  N.  H.  A study  of  the  relative 
values  of  two  modifications  of  the  true-false  test.  Journal  of  Educational 
Research . 1943,  36,  517-527. 


Curtis,  H.  A.,  & Krop? , R,  P.  Experimental  analyses  of  the  effects  of  various 
modes  of  item  presentation  on  the  scores  aid  factorial  content  of  tests  admin- 
istered  by  visual  and  audiovisual  means  - A program  of  studies  basic  to  tele- 
V i si on  te sTTng . Tallahassee,  Florida:  Florida  State  University,  School  of 
Education,  1961.  Report  No.  NDEA-VIIA-3i35. 


-60- 


Dailey,  J.  G.,  and  Hagenah,  T.  Voca tlona  1 ln.!:ere^t_meas^^^  Jheor^ 

and  practice.  University  of  Minnesota  Press,  If 55. 


* 


Dakin,  R.  E.,  & Tennant,  D. 
vals  and  characteristics  of 
73-84. 


Consistency  of  response  by  event-recall  inter- 
respondents.  Sociological  Quarterly,  19i^8, 


Investigator  Error 

Bureau  of  the  Census,  #7131907101 


Dalkey,  N.  C.  Dei£hi  (AD-650  554).  Santa  Monica,  California:  Rand  Corpor- 
ation,*1967.  Report  No.  P-3704. 


N/A 


N/A 


N/A 


T-M 


Dalkey,  N.  C.  Experiments  in  group  prediction  (AD  668  107).  Santa  Mo.nica , 
California:  Rand  Corporation,  1968.  Report  No.  P-3320. 


Damarin,  F.,  & Messlck,  S.  J.  Resp.onse  styles  as  personality  va^ia^.le|: 

A thcoret^ical  iategration  of  multtvariate_researcha  Research  Bulletin  65-1  . 
Princeton,  New  Jersey:  Educational  Testing  Service,  1965. 

N/A 
T-M 


N/A 

N/A 


Danielson,  W.  A.  A d.ita  reduction  method  for  scaling  dichotomous  items. 
Public  Qjlnion  Quarterly,  1957  ,^,  377-379. 


Darling,  A.  B.,  & Bragdon,  H.  W.  Essay  vs.  objective  testing  in  social 
studies!  College  Board  Review,  1952,  J^,  260-263. 

Multiple  Choice  Items,  Ooen-ended  Items  2 

Psych.  Abst.  , 27_,  #663  ^ ^ 


- 


Darnell,  D.  K.  A technique  for  deternilni.ig  the  evaluative  d iscr imlnatlon 
capacity  and  polarity  ofl  semantic  differential  scales  for  specific  concepts. 
Dissertation  Abstracts , 1964,  25(2),  2623-2624. 


Davis,  R.  J.  (fonslderatlons  for  evaluating  evaluation  instruments.  Ill . 
Teaclier  Contemp,  Roles,  1970,  , 232-284. 


Dawes,  R.  M*'  Fundamentals  of  attitude  measurement.  New  York:  John  Wiley, 

1972  . 

Textbook  17 

0R,\  T-H 


Dawis,  R.  V.  The  measurement  of  employee  attitudes  (AD  720  712).  Minnc.npolis , 
Minnesota:  Center  for  the  Stud'^  of  Organizational  Performance  and  Human  Effect- 
iveness, 1971.  Report  No.  TK-3001. 


Dawis,  R.  V.,  & Tinsley,  H.  E.  A.  The  equivalence  of  semantic  and  figural 
test  presentation  of  the  same  items.  Minneapolis,  Minn.:  Minnesota  University 
Center  for  the  Study  of  Organizational  Perforntsace  and  Human  Effectiveness. 
Report  No.  TR-3004. 


Day,  R.  I..  S'  stenatic  paired  comparisons  in  preference  analysis.  Journal 
of  Ms  rketinq  Research,  1965,  2,  406. 

Paired  Comparison  Items,  Preference  Measures  2 

Journal  of  Marketing  Research,  1965,  p.  406  (Rev.)  A-M 


Day,  R.  L.  Methods  of  estimating  consumer  preference  d Istribuciona . 
California  Management  Review.  1967,  9,  35-42. 


Dean,  J.  P.,  & Whyte,  W.  F.  How  do  you  know  if  the  informant  is  telling 
the  truth.  Human  OrKSnization . 1958,  18(4)  , 21-23. 


-62- 


DeFleur,  M.  L.,  & Catton,  W.  R. , Jr. 
attitude  measurement.  Social  Force£, 


The  limits  of  determinacy  in 
1957,  35,  295-300. 


DeGreene,  K. 


DsvcholoRV.  New  York:  McGraw-Hill,  1970 


S^ff^Lc.,  1951,  ^(492),  25i:  IXbstract  of  Master  s thesis.) 

Deming,  W.  E.  On  errors  in  surveys.  American  Sociological  Review,  1944, 

9,  359-369. 

U.  E.  On  the  -l«lnctlon  hewecn  ennmer.ttve  surveys. 

Tnnrnal  American  Statistical  Associate,  1953,  244 


^ n Hardine  J & Pepitone,  A.  D.  Techniques  for 
Deri,  S.,  Dinnerstern,  D.,  ^ Ltitudes  and  behavior.  P^- 
the  diagnosis  and  measurement  of  intergro  p 

ologjcal  Bulletin,  1948,  248-271. 


Attitude  Measures 


Psvch.  Abst.,  23,  #700  (Rev.  from  rept.) 


R-N 


DeSoto,  C.  B.  The  predilection  for  single  orderings 
and  Social  Psychology,  1961,  62,  16-23. 


Journal  of  Abnormal 


Dexter,  L. 
viewing. 


A.  Role  s-elationships  and  ‘^“^^^ptions  of  neutrality 
American  Journal  of  Sociology,  1956,  ^(2),  153  157. 


in 


inter- 


a T T ^ Fvans  V J An  investigation  of  the  cognitive  correlates 
.n,’,rnsl  of  EJuc.tlnn.1  Me.surefent . 1972,  9<2). 


-63- 


Diers,  C.  J.  Social  desirability  and  acquiescence  in  response  to  personality 
items.  Dissertation  Abstracts,  1961,  ^(5),  1709. 


Response  Bias,  Personality  Measures  12 

Dissertation  Abstracts,  22 , p.  1709  (Rev.)  A-H 


Dillehay,  R.  C.,  & Jernigan,  L.  R.  The  biased  questionnaire  as  an  instrument 
of  opinion  change.  Journal  of  Personality  and  Social  Psychology.  1970,  15 , 
144-155. 

Attitude  Measures,  Investigator  Error  10 

Journal  of  Personality  and  Social  Psychology,  15,  R-H 

p.  144  (Rev.) 


Divesta,  F.  J.,  & Walls,  R.  T.  Response  selection  as  a function  of  instruc- 
tions and  motivation  under  nonreinforcement  conditions.  Journal  of  Experi- 
mental Psychology,  1967,  _7^(3) , 365-373. 

Paired  Comparison  Items,  Respondent's  Motivation  11 

Journal  of  Experimental  Psychology,  73 , p.  365  R-NA 


Dixon,  T.  R.  Experimenter  approval,  social  desirability  and  statements  of 
self-reference . Journal  of  Consulting  and  Clinical  Psychology,  1970,  35 . 
400-405. 

Respondent's  Motivation,  Response  Bias  11,  12 

Journal  of  Consulting  and  Clinical  Psychology,  R-N 

35 , p.  400  (Rev.  from  rept.) 


Dizney,  H.  F.,  Merrifield,  P.  R.,  & Davis,  0.  L.,  Jr.  Effects  of  answer- 
sheet  format  on  arithmetic  test  scores.  Educational  and  Psychological 
Measurement , 1966,  ^(2),  491-493. 


3g 


Instrument  Format,  Achievement  Measures 
Psych.  Abst. , 40 , 7713532 


A-M 


•1 


* Dodd,  S.  C.,  & Gerberick,  T.  R.  Word  scales  for  desrees  of  opinion. 
Language  and  Speech.  1960,  18-31. 

Response  Alternatives,  Adjectives 

Psych.  Abst. , 3^,  #6315  ^ ^ 


* Dohrenwend,  B.  S.  Some  effects  of  open  and  closed  questions  on  respond- 
ent's answers.  Human  Organization,  1965,  24(2),  175-184. 

Closed-Ended  Items,  Open-Ended  Items  1 , 3g 


Dohrenwend,  B.  S.  An  experimental  study  of  directive  interviewing. 
Public  Opinion  Quarterly,  1970,  ^(1),  117-125. 


Dohrenwend,  B.  S.  An  experimental  study  of  payments  to  respondents. 
The  Public  Opinion  Quarterly,  Winter  1970-71,  ^(4),  621-624. 


N/A 


N/A 


Dchrenwi^nd,  B.  S.,  Colombotos,  J.,  & Dohrenwend , B . P.  Social  distance 
and  interviewer  effects.  Public  Opinion  Quarterly,  1968,  ^(3),  410-422. 


Dohrenwand,  B.  S.,  & Richardson,  S.  A.  A use  for  leading  questions  in 
research  interviewing.  Journal  of  Marketing,  1940,  > 122-124. 

3c 

Question  Stem  ° 

Bureau  of  Census,  #7111006801  A-N 


Dohrenwend,  B.  S.,  Williams,  J.  A.,  & Weiss,  C.  H.  Interviewer  biasing 
effects:  Toward  a reconciliation  of  findings.  Public  Opj.nion  Quarterly , 
1969,  33(1),  121-129. 


r 

I 


-65- 


Doll,  R.  E.  Item  susceptibility  to  attempted  faking  as  related  to  Item 
characteristic  and  adopted  fake  set  (A.D  740  290).  San  Diego,  California: 
Navy  Medical  Neuropsychiatric  Unit,  1970.  Report  No.  NMNRU-69-33. 


N/A 

N/A 

N/A 

T-H 

Dollard,  J.  Under  what  conditions  do  opinions  predict  behavior?  Public 
Opinion  Quarterly,  1948-49,  1^,  623-632. 


Dolliver,  R.  H.,  & Clark,  J.  A.  Status  faking  on  the  SVIB-M.  Journal  of 
Vocational  Behavior,  1972,  2^(1),  47-55. 


Donald,  M.  N.  Implications  of  nonresponse  for  the  interpretation  of  mail 
questionnaire  data.  Public  Opinion  Quarterly.  1960,  99-114. 


Doncel,  J.  F.,  Alimena,  B.  S.,  & Birch,  C.  M.  Influence  of  prestige  sug- 
gestion on  the  answers  of  a personality  inventory.  Journal  of  Applied 
Psychology . 1949,  33  , 3S2-355. 

Personality  Measures,  Rating  Scales,  Response  Bias  10,  12 

ORA  R'M 


Downie , N.  M.  Fundamentals  of  measurement.  (2nd  ed.)  Oxford  University 
Press,  1967. 

Draguns,  J.  G.  Response  sets  on  the  MMPl  and  in  structuring  ambiguous 
stimuli.  Psychological  Reports.  1963,  1^(3),  823-828. 

Response  Bias,  Personality  Measures  12 

Psych.  Abst.,  #8484  (Rev.)  A-N 


Drayton,  L.  E.  Bias  arising  in  wording  consumer  questions.  Journal  of 
Marketing . 1954,  _1J^,  140-145. 


I 

i 

I 

i 


Dressel,  P.  L.,  & Schmid,  J.  Some  modifications  of  the  multiple-choice 
item.  Educational  and  Psychological  Measurement,  1953,  574-595. 


Drinkwater,  B.  L.  A comparison  of  the  direction-of-perception  techniques 
with  the  Likert  method  in  the  measurement  of  attitudes.  Journal  of  Social 
Psychology . 1965,  ^(2),  189-196. 

Response  Alternatives,  Scales,  Reliability  2,  3f 
Psych.  Abst.,  #4747  (Rev.  from  rept.)  R-H 


Driver,  R.  A.  The  validity  and  reliability  of  ratings.  Personnel . 1941, 
17,  185-191. 


Druckman,  D.,  & Dunning,  M.  C.  Double  agreement  of  reversed  item  attitude 
scales:  Agreement  set  versus  social  desirability.  Psychonomic  Bulletin, 

1967,  1(2),  19. 


* Dubeck,  J.  A.,  et  al.  Falsification  of  the  forced  guilt  inventory. 
Journal  of  Consulting  and  Clinical  Psychology,  1971,  ^(2),  296. 

Response  Bias  12 

ERIC  Document  Reproduction  Service,  EJ  040  673  A-M 


* Dubois,  C.  The  card-sorting  or  psychophysical  interview.  Public  Opinion 
Quarterly , 1949-50,  1^(4),  619-628. 

Investigator  Error,  Card  Sorts,  Response  Bias,  Respon-  2,  11,  12,  13 

dent's  Motivation 

ORA  R-N 


DuBois,  P.,  Loeving,  J.,  & Smith,  T.  L.,  Jr.  Evaluation  for  methods  of 
keying  psychological  tests  for  prediction  of  external  criteria.  USAF 
Personnel  and  Training  Center,  Research  Reproduction  No.  AFPTCR-TN-56-65 , 
1956. 

Question  Stem  3g 

Psych . Abst . , 31 , (A6907  A-N 


-67- 


Journal  of  Social 


Dudycha , G.  J.  A note  on  the  "halo  effect"  in  ratings. 

Psychology . 1942,  1^,  331-333. 

Dunlap,  J.  W.  Problems  arising  from  the  use  of  a separate  answer  sheet. 
Journal  of  Psychology.  1940,  j^,  3-48. 

Instrument  Format  2,  3e,  14,  3g 

Report  summary  R-H 


Dunlap,  J.  M.,  DeMello,  A.,  & Cureton,  E,  E.  The  effects  of  different 
directions  and  scoring  methods  on  the  reliability  of  a true-false  test. 
School  and  Society,  1929,  3^,  378-382. 


Dunn,  T.  F.,  Goldstein,  L.  G.,  & Berkhouse,  R.  G.  Effect  of  item  construc- 
tion principles  on  difficulty,  reliability,  and  validity.  USA  TAGO  Personnel 
Research  Branch,  Technical  Research  Note  64,  1956. 

Achievement  Measures  18 

Psych.  Abst. , 3^,  #2779  A-N 


Dunnette,  M.  D.  Accuracy  of  students'  reported  honor  point  averages. 
Journal  of  Applied  Psychology,  1952,  20-22. 


Dunnette,  M.  D.  , Aylward,  M. , & Uphoff,  M.  H.  The  effect  of  lack  of  infor- 
mation on  the  undecided  response  in  attitude  surveys.  Journal  of  Applied 
Psychology . 1956,  40(3),  150-153. 

Response  Alternatives,  Clarity  3a,  4 

ORA  R-H 


Dunnette,  M.  D.,  & Heneman,  H.  G.,  Jr.  Influence  of  scale  administrator  on 
employee  attitude  responses.  Journal  of  Applied  Psychology.  1956,  73-77. 

3g,  11,  9 


Anonymous  Respondent,  Investigator  Error 

Journal  of  Applied  Psychology.  40,  p.  77  (Rev.  from  rept.) 


R-H 


Dunn-Rankin,  P.  The  true  probability  distribution  of  the  range  of  rank 
totals  and  its  application  to  psychological  scaling.  Dissertation  Abstracts, 
1966,  ^(8),  4827. 

Durant,  H.,  & Maas,  I.  Who  doesn't  answer?  British  Psychological  and 
Sociological  Bulletin,  1956,  33-34. 

Instrument  Length,  Respondent's  Motivation  5,  11 

Potter,  Sharpe,  Hendee  and  Clark,  1972  A-M 


Durbin,  J.,  & Stuart,  A.  Differences  in  response  rates  of  experienced 
and  inexperienced  interviewers.  London;  London  School  of  Economics  and 
Political  Science,  Survey  Research  Centre,  n.d. 


Dyer,  R.  F.,  Klein,  R.  D. , & Yudowitch,  K.  L.  Analysis  of  alternate  forms 
of  a VOIAR/MVA  questionnaire.  Palo  Alto:  Operations  Research  Associa tes , 

1974.  (Prepared  for  the  Army  Research  Institute  for  the  Behavioral  and 
Social  Sciences,  Fort  Hood,  Texas  under  Contract  DAHC19-74-C-0032 . ) 

Question  Stems,  Response  Alternatives  3b,  3c 

ORA 


Dyer,  R.  F.,  Matthews,  J.  J.,  Wright,  C.  E.  & Yudowitch,  K.  L.  Questionnaire 
construction  manual.  Palo  Alto:  Operations  Research  Assoc ia tes  , 1975. 

Questionnaire  Theory  and  Development  14,  17 

ORA  R-H 

Ebel,  R.  L.  Estimation  of  the  reliability  of  ratings.  Psychometrika , 1951, 

16,  407-424. 

Ebel,  R.  L.  Measuring  educational  achievement.  New  Jersey:  Prentice-Hall  ,1965 . 

Ebel,  R.  L.  Expected  reliability  as  a function  of  choices  per  item.  Educa - 
tional  and  Psychological  Measurement.  1969,  ^(3),  565-570. 


Ebel,  R.  L.  The  test  for  true-false  test  items. 
(3),  363-389. 


School  Review.  1970,  78 


I 

f 


1 


Ebel,  R.  L.  The  comparative  effectiveness  of  true-false  and  multiple  choice 
achievement  test  items  (ERIC  Document  Reproduction  Service,  ED  050  148). 

Paper  presented  at  the  Annual  Meeting  of  the  American  Educational  Research 
Association,  New  York,  1971  (a). 

Achievement  Measures  18 

ERIC  Document  Reproduction  Service,  ED  050  148  A-N 


Ebel,  R.  L.  How  to  write  true-false  test  items.  Educational  and  Psycho- 
logical Measurement,  1971  (b) , ^(2),  417-426. 


Ebel,  R.  L.  Why  is  a longer  test  usually  a more  reliable  test?  Education- 
al and  Psychological  Measurement,  1972,  ^2(2),  249-253. 

Achievement  Measures,  Instrument  Length  5 

ERIC  Document  Reproduction  Service,  EJ  061  302  A-N 

Ebel,  R.  L.  Test  development,  interpretation,  and  use.  TM  Report  19 . 
Princeton,  New  Jersey:  Educational  Testing  Service,  1973. 

Literature  Review  16 

ORA  R-H 


Echternacht,  G.  J.,  et  al.  An  evaluation  of  the  feasibility  of  confidence 
testing  as  a diagnostic  aid  in  technical  training  (ERIC  Document  Reproduc- 
tion Service,  ED  058  318).  Princeton,  New  Jersey:  Educational  Testing 
Service,  1971. 


Eckland,  B.  K.  Effects  of  prodding  to  increase  mail-back  returns.  Journal 
of  Applied  Psychology,  1965,^  (3), 165-169. 


Edgerton,  H.  A.,  Britt,  S.  H.,  & Norman,  R.  D.  Objective  differences  among 
various  types  of  residents  to  a mailed  questionnaire.  American  Sociological 
Review,  1947,  1^,  435-444. 


Edgertor.,,  H.  A.,  & Kolbe,  L.  E.  The  method  of  minimum  variation  for  the 
combination  of  criteria.  Psychometrika . 1936,  1^,  183-187. 


i 


-70- 


t 


* Edrich,  H.  The  effects  of  context  and  of  raters'  attitudes  on  iudsments  of 
favorableness  of  statements  about  a social  group  (AD  653  916).  Boulder, 
Colorado;  University  of  Colorado,  1965.  Project  AF-9778.  Institute  of  Be- 
havioral Science. 

Question  Stem  3g 

DDT,  # AD  653  916  A-M 


Educational  Testing  Service.  Multiple  choice  questions:  A close  look 
(ERIC  Document  Reproduction  Service,  ED  081  783).  Princeton,  New  Jersey: 
Educational  Testing  Service,  1973. 

Achievement  Measures  18 

ERIC  Document  Reproduction  Service,  ED  081  783  A-N 


Edwards,  A.  L.  Assessing  opinion  and  discovering  facts.  Psychology  for 
the  Armed  Forces.  Washington,  D.  C.,  Infantry  Journal  Press,  1945.  Chapter  21. 


Edwards,  A.  L.  A comparison  of  the  Thurstone  and  Likert  methods  of  attitude 
scale  construction.  Journal  of  Applied  Psychology.  1946  (a),  ^(1),  72-83. 


* Edwards,  A.  L.  A critique  of  "neutral  items"  in  attitude  scales  constructed 
by  the  method  of  equal-appearing  intervals.  Psychological  Review.  1946  (b) , 
159-169. 

Reliability,  Scaling  2,  8 

ORA  R-M 


Edwards,  A.  L.  On  Guttman's  scale  analysis.  Educational  and  Psychological 
Measurement , 1949,  8,  313-318. 


Scaling  14 

ORA  R-H 

* Edwards,  A.  L.  Psychological  scaling  by  means  of  successive  intervals. 
Psychometric  Laboratory:  University  of  Chicago,  1951,  M,  1-13. 


Scaling  14 

ORA  R-H 


Edwards,  A.  L.  The  staling  of  stimuli  by  the  method  of  successive  inter- 
vals. Journal  of  Applied  Psychology.  1952,  118-122. 


-71- 


I 


A 


•«  <- 


Edwards,  A.  L.  The  relationship  between  the  judged  desirability  of  a trait 
and  the  probability  that  the  trait  will  be  endorsed.  Journal  of  Applied 
Psychology.  1953,  37(2),  90-93. 


Edwards,  A.  L.  Experiments:  Their  planning  and  execution.  In  Lindzey, 

G.  (Ed.),  Handbook  of  social  psychology.  Cambridge,  Mass.:  Addison-Wesley , 
1954. 


Reliability,  Scaling 


* Edwards,  A.  L.  Social  desirability  and  Q sorts.  Journal  of  Consulting 
Psychology . 1955,  1^,  462. 

Card  Sorts,  Response  Bias  12 


* Edwards,  A.  L.  A technique  for  increasing  the  reproducibility  of  cumulative 

attitude  scale.  Journal  of  Applied  Psychology.  1956,  263-265. 

Attitude  Measures,  Paired  Comparison  Items,  Rating  Scales,  2,  14 
Scaling 

Psych.  Abst. . 31 . #5916  (Rev.  from  rept.)  R-H 


* Edwards,  A.  L.  Social  desirability  and  probability  of  endorsement  of  items 
in  the  interpersonal  check  list.  Journal  of  Abnormal  and  Social  Psychology. 
1957  (a),  394-396.  ' 


Anonymous  Respondent,  Response  Bias 

Journal  of  Abnormal  and  Social  Psychology.  55,  p.  396 


11,  12 


Edwards,  A.  L.  The  social  desirability  variable  in  personality  assessment 
and  research.  New  York:  Dryden  Press,  1957  (b) . 


Edwards,  A.  L.  Techniques  of  attitude  scale  construction.  New  York: 
Appleton-Century-Crof ts , 1957(c) . 


-72- 


Edwards,  A.  L.  Statistical  analysis.  (Rev.  ed.)  New  York:  Holt,  Rhinehart, 
and  Winston,  1960. 


Edwards,  A.  L.  Correlations  between  scores  on  personality  scales  when 
items  are  stated  in  the  first  and  third  person  form.  Educational  and  Psycho- 
logical Measurement,  1969,  ^(3),  561-563. 


Edwards,  A.  L.,  & Abbott,  R.  D.  Measurement  of  personality  traits:  Theory 
and  technique.  Annual  Review  of  Psychology,  1973,.^,  241-278. 

Literature  Review  16 

PASAR,  50,  #03043  R-NA 


Edwards,  A. 
pretation  o 
501-509. 


L.  , 6c  Diers  , 
f the  MMPI. 


C.  J.  Social  desirability  and  the  factorial  inter- 
Educational  and  Psychological  Measurement,  1962,  22  , 


* Edwards,  A.  L.,  6c  Diers,  C.  J.  Neutral  items  as  a measure  of  acquiescence. 
Educational  and  Psychological  Measurement,  1963,  2_3(4) , 687-698. 

Response  Bias,  Personality  Measures  12 

Psych.  Abst. , _38,  #8486  A-H 


Edwards,  A.  L.,  Diers,  C . J . , 6c  Walker,  J.  N.  Response  sets  and  factor 
loadings  on  sixty-one  personality  scales.  Journal  of  Applied  Psychology, 
1962,  220-225. 


* Edwards,  A.  L.,  6c  Horst,  P.  Social  desirability  as  variable  in  Q technique 
studies,  Educational  and  Psychological  Measurement,  1953,  J_3,  620-625. 

12 


Response  Bias,  Card  Sorts 

Educational  and  Psychological  Measurement,  13,  p.  625 


R-H 


Edwards,  A.  L.,  & Kenny,  K.  C.  A comparison  of  the  Thurstone  and  Lickert 
methods  of  attitude  scale  construction.  Journal  of  Applied  Psychology.  1946, 
72-83. 


Attitude  Measures,  Reliability,  Scaling  2,  8,  14 

ORA  R-H 


Edwards,  A.  L.,  & Kilpatrick,  F.  P.  Scale  analysis  and  the  measurement  of 
social  attitudes.  Psychomc trika . 1948  (a),  1^,  99-114. 

Edwards,  A.  L.,  & Kilpatrick,  F.  P.  A technique  for  the  construction  of 
attitude  scales.  Journal  of  Applied  Psychology.  1948  (b) , 32,  374-384. 

Attitude  Measures,  Scaling  2,  14 

ORA  R-H 


Edwards,  A.  L. , & Thurstone,  L.  L.  An  internal  consistence  check  for  scale 
values  determined  by  the  method  of  successive  intervals.  Psychome trika . 1952. 
17,  169-180. 


Edwards,  A.  L.,  & Walsh,  J.  A.  Response  sets  in  standard  and  experimental 
personality  scales.  American  Educational  Research  Journal.  1964,  J.(l)  , 52-61. 


Ehrlich,  J.  S.,  & Reisman,  D.  Age  and  authority  in  the  interview.  Public 
Opinion  Quarterly,  1961,  ^(1),  39~56. 

Investigator  Error  9 

Bauman,  Rogers,  and  Weiss,  1971  (Rev.)  A-M 


Eisenberg,  P 
and  dislike, 
246-251. 


Two  methods  of  combining  attitudes  of  like,  indifference, 
into  one  score.  Journal  of  Applied  Psychology,  1945,  29, 


Eisenberg,  T.  Fakability  and  validity  nf  two  types  of 
Dissertation  Abstracts,  1965,  ^(3),  1790-1791  . 

Response  Bias,  Forced  Choice  Items 

Dissertation  Abstracts,  26(3),  pp.  1790-1791  (Rev.) 


forced-choice  formats. 

2,  12 
A-H 


Eisenman,  R , & Rappaport,  J.  Complexity  preference  and  semantic  differen 
tial  ratings  of  complexity-simplicity  and  symmetry-asymmetry . Psychonomic 
Science.  1967,  7(4),  147-148. 


Eisler,  H.  The  connection  between  magnitude  and  discrimination  scales  and 
direct  and  indirect  scaling  methods.  Psychometrika . 1965,  30(3),  271-289. 

Scaling,  Response  Alternatives,  Data  Analysis,  8 

Paired  Comparison  Items 

Psych.  Abst . , 40 , #2091  A-M 


Ekehammar,  B.,  & Magnusson,  D.  Subjective  confidence  and  interjudge  agree- 
ment as  functions  of  amount  of  information:  A study  of  interview  data. 
Stockholm:  Psychological  Labora tories , Univ.  of  Stockholm,  1972,  No.  366. 


Ekman,  G.  Scales  of  Conservatism  (AD  688  342).  Stockholm,  Sweden: 
Stockholm  University,  Psychological  Laboratories , January  1963. 


Ekman,  G.,  & Kunnapas,  T.  Note  on  direct  and  indirect  scaling  methods. 
Psychological  Reports.  I960,  174. 

Rating  Scales  2 

Psych.  Abst. , 35 , #5483  A-H 


Ekman,  G.,  & Sjoberg,  L.  Scaling.  Annual  Review  of  Psychology.  1965.  16 
451-474.  ~ 


-75- 


Elashotf,  J.  D.,  & Spiegel,  D.  E.  Optimal  choice  of  rater  teams:  II. 
Applications.  Psychometrika , 1969,  3^(1) > 33-44. 


Elinson,  J.  Attitude  research  in  the  Army.  Journal  of  Applied  Psychology, 
1949,  33,  1-5. 

Military  Personnel,  Attitude  Measures  15 

Psych . Abst . , 23 , #3688  A-N 


Elinson,  J.,  & Cisin,  I.  H.  Detection  of  interviewer  cheating  through 
scale  technique.  Public  Opinion  Quarterly,  1948,  12 . 


Elison,  J.,  & Haines,  V.  T.  Role  of  anonymity  in  attitude  surveys.  American 
Psychologist , 1950,  315. 


* Ellenbogen,  B.  L.,  & Danley,  R.  A.  Comparability  of  responses  to  a socially  ] 

concordant  question:  "Open-end"  and  "closed".  Journal  of  Health  and  Human  ]j 

gehavier , 1962,  _3(2)  , 136-140.  |j 

1 

Closed-Ended  Items,  Open-Ended  Items  1,2  j 

Bureau  of  the  Census,  #7110004901  A-H  ’ 


Elliot,  F.  R.  Eye  vs.  ear  in  moulding  opinion.  Public  Opinion  Quarterly. 
1936,  1,  83-87. 


Elliot,  L.  L.  Factorial  structure  of  airman  self-rating  and  their  relation- 
ship to  peer  nominations  (AD  242  388).  Wright  Patterson  AFB,  Ohio:  Aero- 
nautical Systems  Division,  1960. 


N/A 

N/A 


N/A 

R-NA 


-76- 


* Elliot,  L.  L.  Effects  of  item  construction  and  respondent  attitude  on 
response  acquiescence.  Educational  and  Psychological  Measurement.  1961, 
21,  405-415. 


Response  Bias,  Question  Stem,  Military  Personnel 
Psych.  Abst.,  36,  #2H  FOSE 


12,  9 


Ellis,  A.  Personality  questionnaires.  Review  of  Educational  Research. 
1947  (a),  ll,  53-63. 


* Ellis,  A.  Questionnaire  versus  interview  methods  ..n  the  study  of  human 
love  relationships.  American  SocioloRical  Review,  1947  (b) , 541-553. 

Interviews  1 

Psych.  Abst . , 23 , #4196  A-H 


* Ellis,  A.  Questionnaire  versus  interview  methods  in  the  study  of  human 
love  relationships.  II.  Uncategorized  responses.  American  Sociological 
Review , 1948,  _0,  61-65. 

Interviews  1 

Potter,  Sharpe,  Hendee  and  Clark,  1972  A-M 


Ellis,  R.  A.,  Endo,  C.  M.,  & Armer , M.  The  use  of  potential  nonrespondents 
for  studying  nonresponse  bias  (AD-708  207).  Eugene,  Oregon:  Oregon  Univ., 
Dept,  of  Sociology,  1970.  Contract  No.  AF-AFOSR-1582-68,  PH-MH-15735. 


Ellis,  A.,  & Gerberich,  J.  R.  Interests  and  attitudes.  Review  of  Educa- 
tional Research,  1947,  17,  64-77. 


Elster,  R.  S.,  & Githens,  W.  H.  Preferences  for  senior  naval  officers' 
billets  obtained  by  using  four  different  psychological  scaling  techniques. 
Proceedings  of  the  Annual  Convention  of  the  American  Psychological  Associa 


tion,  1972,  7(Pt.  1),  21-22. 


-77- 


Elster,  R.  S.,  & Githens,  W.  H.  Development  of  a man-to-man  rating  scale 
for  evaluating  performance.  Proceedings  of  the  81st  Annual  Convention  of  the 
American  Psychological  Association.  Montreal,  Canada,  1973,  8,  761-762. 


* Eng,  E.,  & French,  R.  L.  The  determination  of  sociometric  status. 
Sociometry , 1948,  , 36d-371. 


Paired  Comparison  Items,  Rating  Scales,  Scaling 
Psych.  Abst. , 25 , #7384 


Engel,  J.  F.  Tape  recorders  in  consumer  research.  Journal  of  Marketing. 
1962,  ^(2),  73-74. 


Interviews,  Projective  Items,  Open-Ended  Items 


3g.  15 


Engelhart,  M.  D.  Suggestions  for  writing  achievement  exercises  to  be  used 
in  tests  scored  on  the  electric  scoring  machine.  Educational  and  Psychologi- 
cal Measurement.  1947,  _7. 


Engelhart,  M.  D.  A comparison  of  several  item  discrimination  indices. 
Journal  of  Educational  Measurement,  1965,  2(1),  69-76. 

Achievement  Measures,  Data  Analysis  8 


Psych.  Abst.,  39,  #15229 


Engels,  J.  F.,  & Wales,  H.  G.  Spoken  versus  pictured  questions  on  taboo 
topics.  Journal  of  Advertising  Research.  1962,  2(_l)  , 11-17. 

Instrument  Format  3g 

Bureau  of  Census,  #7111011601  A-M 


* England,  L.  R.  Capital  punishment  and  open-end  questions.  Public  Opinion 
Quarterly . 1948,  12,  412-416. 

Open-Ended  Items,  Close-Ended  Items,  Response  Alternatives  2 


-78- 


Engvik,  H.,  Kvale,  S.,  & Havik,  0.  E.  Rater  reliability  in  evaluation  of 
essay  and  oral  examinations.  Pedagogisk  Forskning:  Scandinavian  Journal  of 
Educational  Research.  1970,  4,  195-220. 


Erdos,  P.  L.  How  to  save  time  and  money  on  the  tabulation  of  surveys. 
Printers'  Ink.  1948  (a),  ^(7),  36-37. 


Erdos,  P.  L.  Planning  the  questionnaire  for  tabulation.  International 
Jrurnal  of  Opinion  and  Attitude  Research,  1948  (b) , 2(3),  401-408. 


Instrument  Format,  Scoring 


8 


Bureau  of  Census,  #7110002001  (Rev.  from  rept.) 


R-M 


Erdos,  P.  L.,  & Morgan,  A.  J.  Professional  mail  surveys.  New  York: 
McGraw-Hill,  1970. 


ERIC  Clearinghouse  on  Tests,  Measurements,  and  Evaluation.  Test  bias: 

A bibliography  (ERIC  Document  Reproduction  Service,  ED  051  312).  Princeton, 
New  Jersey:  ERIC  Clearinghouse  on  Tests,  Measurements,  and  Evaluation,  1971. 

Response  Bias,  Investigator  Error,  Bibliography  12,  16 

ORA  ' R-M 


Eriksen,  C.  W.,  & Hake,  H.  W.  Absolute  judgments  as  a function  of  stimulus 
range  and  number  of  stimulus  and  response  categories.  Journal  of  Experimental 
Psychology , 1955,  323-332. 


Ericksen,  S.  C.  A skeptical  note  on  the  use  of  attitude  scales  toward  war: 
I.  In  1940,  1941.  Journal  of  Social  Psychology,  1942,  1^,  229-242. 


Evans,  F.  R.,  & Reilly,  R.  A study  of  speededness  as  a source  of  test  bias 
(ERIC  Document  Reproduction  Service,  ED  053  207).  Princeton,  New  Jersey: 
Educational  Testing  Service,  1971. 


-79- 


■»V 


A 


r 


I 


y 


* 


Eysenck, 
naires . 
20-24. 


H.  J.  Response  set,  authoritarianism  and  personality  question 
British  Journal  of  Social  and  Clinical  PsychoIoRv.  1962,  ^(1), 


Response  Bias,  Attitude  Measures 


12 


Psych.  Abst. , 37,  #1252 


A-N 


* Eysenck,  H.  J.,  & Crown,  S.  An  experimental  study  in  opinion-attitude 
methodology.  International  Journal  of  Opinion  and  Attitude  Research.  1949, 
3,  47-86. 

(questionnaire  Theory  and  Development,  Scaling  2,  14 

International  Journal  of  Opinion  and  Attitude  Research.  R-M 

3,  p.  47-86  (Rev.  from  rept.) 


* Eysenck,  H,  J.,  & Eysenck,  S.  B.  G.  A factorial  study  of  an  interview- 
questionna  .re . Journal  of  Clinical  Psychology,  1962,  18(3),  286-290. 

Interviews,  Personality  Measures  1,  7 

Psych.  Abst . , 39,  #175  A-H 


* Eysenck,  S.  B.  G.,  & Eysenck,  H.  J.  Acquiescent  response  set  in  person- 
ality questionnaires.  Life  Sciences,  1963  (a).  No.  2,  144-147. 

Response  lias.  Personality  Measures  2,  12 

Psych.  Abst. . 38,  #995  A-M 


Eysenck,  S.  1.  G.,  & Eysenck,  H.  J.  An  experimental  investigation  of 
"desirability"  response  set  in  a personality  questionnaire.  Life  Sciences. 
1963  (b).  No.  5,  343-355. 

Response  Bias,  Personality  Measures  12 

Psych.  Abst. . 38 , #2712  A-N 


-80- 


F 


* Faerber,  N.  N.  An  experimental  study  of  the  relative  difficulty  of  hand- 
scored  versions  of  the  same  test.  Bulletin  of  the  National  Institute  of 
Parsonnel  Research,  Johannesburg,  1951,  3(2),  10-19. 

Instrument  Format,  Multiple  Choice  Items,  Open-ended  Items  2,  3g,  14 
Psych.  Abst.,  #772  A-M 


* Falk,  G.  H.,  & Bayroff,  A.  G.  Rater  and  technique  contamination  in  criterion 
ratings.  Journal  of  Applied  Psychology,  1954,  38,  100-102. 

Raters,  Investigator  Error  13 

Psych.  Abst. , 29_,  #3125  A-H 


Falthzik,  A.  M. , & Carroll,  S.  J.  Rate  of  return  for  closed  versus  open 
ended  questions  in  a mail  questionnaire  survey  industrial  organization. 
Psychological  Reports,  1971,  ^ (3,  Pt.  2),  1121-1122. 


* Falthzik,  A.  M.,  & Jolson,  M.  A.  Statement  polarity  in  attitude  studies. 
Journal  of  Marketing  Research,  1974,  U,,  102-105. 

Instrument  Format,  Question  Stem,  Response  Bias  3g,  12 

ORA 


Farley,  F.  H.  Global  self-ratings,  the  independence  of  questionnaire 
drive  and  anxiety,  and  social  desirability  variance.  Acta  Psychologica , 
Amsterdam,  1968,  ^(4),  387-397. 


Farnsworth,  P.  R.  Shifts  in  the  values  of  opinion  items.  Journal  of 
Psychology . 1943,  1^,  125-128. 


* Farnsworth,  P.  R.  Attitude  scale  construction  and  the  method  of  equal- 
appearing intervals.  Journal  of  Psychology,  1945(a),  245-248. 

Attitude  Measures,  Scaling,  Investigator  Error,  Rating  4,  14 

Scales,  Clarity 


-81- 


* Farnsworth,  P.  R.  Further  data  on  the  obtaining  of  Thurstone  scale  values, 
Journal  of  Psychology.  1945  (b) , 19,  69-73. 


Scaling 

Psych . Abs  t . . 19 , #1281 


14 

R-H 


Fear,  R.  A.,  The  evaluation  interview:  Predicting  job  performance  in  busi- 
ness and  industry.  New  York:  McGraw-Hill,  1958. 


Feather,  N.  T.  Test-rctest  reliability  of  individual  values  and  value 
systems.  Australian  Psychologist.  1971,  6(3),  181-188. 


Feder,  D.  D.  Effect  of  direction  and  arrangement  of  items  on  students' 
performance  in  a test.  Journal  of  Educational  Research.  1936,  28-45, 


Federico,  P.  Development  of  psychometric  measures  of  student  attitudes 
toward  technical  training:  Reliability  and  factorial  validity.  Brooks  AFB, 
Texas:  Air  Force  Human  Resources  Laboratory,  1970.  Report  No.  AFHRL-TR-70-37 

Attitude  Measures,  Military  Personnel,  Reliability,  2 

Scaling,  Validity 


DDC 


A-M 


Federico,  P.  Degree  of  evaluation  assertions  ascribed  to  an  attitude 
universe  as  a function  of  measurement  format  (AD  736  788).  Psycho. ogica 1 
Reports , 1971(a),  1315-1324. 


Attitude  Measures,  Scaling 
DDC 


2 

A-H 


Federico,  P.  I'entifying  item  validity  indices  utilizing  a multivariate 
mode  1 . Lowry  AFB,  Colorado:  Technical  Training  Division,  Air  Force  Human 
Resources  Laboratory,  1971  (b) . AFHRL-TR-71-16 . 


N/A 

N/A 


N/A 

T-M 


-82- 


Fehrer,  E.  Shifts  in  scale  values  of  attitude  statements 
of  the ’composition  of  the  scale.  Journal  of  Exper imenta  1_ 
44,  179-188. 


as  a function 
Psychology . 1952, 


* Fehrer,  E.,  & Strupp,  H.  The  effect 

prestige  value.  Journal  of  Applied  Psychology,  1949,  33,  222  230. 


Scaling,  Interest  Measures,  Response  Bias 
Psych . Abst . , 24 , #1923 


8,  12 

A-N 


Feldman,  J 
effects  on 
734-761. 


j.,  Hyman,  H.,  & Hart,  C.  W.  A field  study  of  interviewer 
the  quality  of  survey  data.  Public  Opinion  Quarterly,  1951, 


Investigator  Error,  Interviews,  Response  Bias 
Psvrh.  Abst.  , #3451  (Rev.  from  rept.) 


* 


Feldman , 
me  thod . 


M.  J.,  & Corah,  N.  L.  Social  desirability  and  the 
Journal  of  Consulting  Psychology,  I960,  2U,  480-482 


forced  choice 


Response  Bias,  Forced  Choice  Items 


2,  12 


Psveh.  Abst  , #1HF  80F 


* Feldman,  S.  Evaluative  ratings  of  adjective-adjective  combinations,  pre- 
dicted from  ratings  of  their  components.  Dissertation  Abstracts  Inter- 
national , 1969,  _^(2-U) , 864. 

Adjectives,  Response  Alternatives,  Semantic  Dif-  . 3b,  3f 

ferential  Items 

PiEsertation  Abstracts  International,  30,  864-B  (rev.)  A-H 


Ferber,  R.  Order  bi.as  in  a mail  survey.  Journal  of  Marketing,  1952, 
(2),  171-178. 


-83- 


* Ferber,  R.  The  effect  of  respondent  ignorance  on  survey  results.  Journal 
of  the  American  Statistical  Association.  1956,^,  576-586. 

Investigator  Error  9 

Psych.  Abst.  , #372  A-M 


* Ferber,  R.  Item  nonresponse  in  a consumer  survey.  Public  Opinion  Quarterly, 
1966,  30,  399-415. 

Investigator  Error  3c,  9 

Potter,  Sharpe,  Hendee , and  Clark,  1972  R-NA 


Ferber,  R.,  & Hauck,  M.  A framework  for  dealing  with  response  errors  in 
consumer  surveys.  Marketing  Concept  in  Action,  1964,  533-540. 


* Ferber,  R.,  & Wales,  H.  G.  Detection  and  correction  of  interviewer  bias. 
Public  Opinion  Quarterly,  1952,  1^(1),  107-127. 

Investigator  Error,  Interviews  13 

Bureau  of  the  Census,  #7111030701  R-H 


Ferber,  R.,  & Wales,  H.  G.  Advertising  recall  in  relation  to  type  of 
recall.  Public  Opinion  Quarterly,  1958-59,  ^(4),  529-536. 


Ferber,  R.,  & Wales,  H.  A basic  bibliography  on  marketing  research. 
Chicago:  American  Marketing  Association,  1963. 


Ferguson,  H.  H.  Incentives  and  an  intelligence  test.  Australian  Journal 
of  Psychology,  1935,  39-53. 


* Ferguson,  L.  W.  The  influence  of  individual  attitudes  on  construction  of 
an  attitude  scale.  Journal  of  Social  Psychology,  1935  (a),  6,  115-117. 

Attitude  Measures,  Response  Bias,  Investigator  10,  12 

Error,  Scaling 


ORA 


R-M 


Ferguson,  L.  W.  Some  problems  in  the  measurement  of  attitudes.  Unpub- 
lished Master's  thesis,  Stanford  University.  1935  (b)  . 


N/A 


N/A 


N/A 


T-M 


Ferguson,  L.  W.  The  isolation  and  measurement  of  general  attitudes. 
Paper  presented  at  the  Philadelphia  meetings  of  the  Eastern  Psychological 
Association,  1939  (a). 


N/A 


N/A 


N/A 


T-M 


Ferguson,  L.  W.  The  requirements  of  an  adequate  attitude  scale.  P^cho- 
logical  Bulletin,  1939  (b) , 665-673. 

Attitude  Measures,  Literature  Review,  Scaling  16,  1^>  15 


ORA 


R-N 


Ferguson’,  L.  W.  Comparison  of  scale  values  from  the  method  of  equal 
appearing  intervals  and  paired  comparisons  method.  Journal  of  General 
Psychology , 1940,  431-435. 


Ferguson,  L.  W.  A study  of  the  Likert  technique  of  attitude  scale  construc- 
tion. Journal  of  Social  Psychology.  1941,  13,  51-57. 


Ferguson,  L.  W.  A revision  of  the  primary  social  attitude  scale.  Journal 
of  Psychology.  1944,  1^,  229-241. 


Ferguson,  L.  W.  Psychology  and  the  Army:  Introduction  of  the  Rating 

Scale.  Heritage  of  Industrial  Psychology.  1963,  9_,  125-139. 

Bibliography 

Psych.  Abst. , 38 , #6833  A-M 


-85- 


Ferguson,  L.  W.,  Huguenard,  T.,  & Sager,  E.  B.  Interview  time,  interview 
set,  and  interview  outcome.  Perceptual  and  Motor  Skills.  1970,  31(3)  , 
831-836. 


Fernberger,  S.  W.  Instructions  and  the  psychophysical  limen.  American 
Journal  of  Psychology,  1931,  43,  93-100. 


Fernberger,  S.  W.,  Glass,  E.,  Hoffman,  I.,  & Willlg,  M.  Judgment  times  of 
different  psychophysical  categories.  Journal  of  Experimental  Psychology, 
1934,  17,  286-293. 


Fernberger,  S.  W. , & Irwin,  F.  W.  Time  relations  for  the  different  cate- 
gories of  judgment  in  the  "absolute  method"  in  psychophysics.  American 
Journal  of  Psychology,  1932,  505-525. 


Ferris,  A.  C.  A note  on  stimulating  response  to  questionnaires.  American 
Sociological  Review.  1951,  1^,  247-249. 

Respondents'  motivation  11 

ORA 


Festinger,  L.  Studies  in  decision:  1.  Decision  time,  relative  frequency 
of  judgment,  and  subjective  confidence  as  related  to  physical  stimulus 
difference.  Journal  of  Experimental  Psychology.  1943,  241-306. 


Festinger,  L.  The  treatment  of  qualitative  data  by  "scale  analysis." 
Psychological  Bulletin.  1947,  149-161. 


Festinger,  L.,  and  Katz,  D.  (Eds.)  Research  methods  in  the  behavioral 
sciences . New  York:  Dryden  Press,  1953. 


Field,  J.  B.  The  effects  of  praise  in  a public  opinion  poll.  Public 
Opinion  Quarterly,  1955,  1^(1),  85-91. 

Interviews,  Open-Ended  Items,  Respondent's  Motivation  11 

ORA  R-H 


-86- 


Fillenbaum,  S.  The  effect  of  a remote  anchor  upon  judgment  with  a salient 
within-series  stimulus-object  and  with  a free  choice  of  scale.  American 
Journal  of  PsvcholoRv,  1961,  74 , 602-606. 


N/A 

N/A 

N/A 

T-H 

Finch,  C.  W. , & Gibson,  J.  N.  Development  of  a questionnaire  to  measure 
Air  Force  junior  officer's  attitudes  toward  intrinsic  aspects  of  the  work 
itself  (AD  743  405).  Wright-Patterson  AFB:  Air  Force  Institute  of 

Technology,  School  of  Systems  and  Logistics,  1971.  "Report  No.  SLSR-12-72A. 


N/A 

N/A 

N/A 

T-M 

* Findikyan,  N.  Acquiescence  and  bipolarity  in  personality  questionnaires. 
Dissertation  Abstracts,  1969,  ^(8-B)  , 3103-3104. 

Response  Bias,  Personality  Measures,  Rating  Scales  2,  12 

Dissertation  Abstracts,  ^(8-B),  p.  3103  (Rev.  from  rept.)  A-N 


Fink,  H.  C.  Fictitious  groups  and  the  generality  of  prejudice:  An  arti 

fact  of  scales  without  neutral  categories.  Psychological  Reports,  1971, 
^(2),  359-365, 


Firm,  R.  H.  Effects  of  some  variations  in  rating  scale  characteristics  on 
the  means  and  reliabilities  of  ratings.  Educational  and  Psychological 
Measurement , 1972,  ^(2),  255-265. 

Response  Alternatives  3a,  3f 

Psych.  Abst . . 49 , #8213  A-H 


* Fischer,  R.  P.  Signed  versus  unsigned  personal  questionnaires.  Journal 
of  Applied  Psychology,  1946,  ^(3),  220-225. 

Anonymous  Respondent  11 

Potter,  Sharpe,  Hendee,  and  Clark,  1972  A-M 


-87- 


Fishbein,  M.  The  relationships  between  beliefs,  attitudes  and  behavior. 
In  Feldman,  S.  (Ed.),  Cognitive  consistency.  New  York;  Academic  Press, 
1966.  Pp.  199-223. 


Fishbein,  M.  Attitude  and  the  prediction  of  behavior.  In  Fishbein,  J. 
(Ed.),  Readings  in  attitude  theory  and  measurement.  New  York;  John  Wiley, 

1967.’ 

4 

Fishbein,  M. , & Ajzen,  I.  Attitudes  and  opinions.  Annual  Review  of 
Psychology , 1972,  487-544. 

Attitude  Measures,  Literature  Review  16 


Fishburn,  P.  C.  Semiorders  and  risky  choices.  Journal  of  Mathematical 
Psychology , 1968,  358-361. 


Fisher,  G.  A discriminant  analysis  of  reporting  errors  in  health  inter- 
views. Applied  Statistics,  1962,  1A(3) , 148-163. 


Fisher,  H.  Interview  bias  in  the  recording  operation.  International 
Journal  of  Opinion  and  Attitude  Research.  1950,  4,  391-411. 


Fiske,  D.  W.  Consistency  of  the  factorial  structures  in  personality 
ratings  from  different  sources.  (Ph.D.  thesis).  University  of  Michigan, 
1948. 


Fiske,  D.  W.  Subject  reactions  to  inventory  format  and  content.  Pro 
ceedings  of  the  77th  Annual  Convention  of  the  American  Psychological 
Associa tion , 1969,  4,  137-138. 

Military  Personnel,  Instrument  Format,  Investigator  3g 

Error 

ORA 


-88- 


Fiske,  D.  W.,  and  Cox,  J.  A.,  Jr.  The  consistency  of  ratings  by  peers, 
Journal  of  Applied  Psychology.  1960,  11-17. 


Fiske,  D.  W.,  6t  Pearson,  P.  H.  Theory  and  techniques  of  personality  measure- 
ment.’ Annual  Review  of  Psychology,  1970,  49-86. 


* Flanagan,  J.  C.  The  critical  incident  technique.  Psychological  Bulletin, 
1954,  51,  327-358. 

Critical  Incident  Technique  3g,  17 


* Flanagan,  J.  C.  The  development  of  an  index  of  exatainee  motivation. 
Educational  and  Psychological  Measurement,  1955,  1_5,  144-151. 

Response  Bias,  Military  Personnel 

Educational  and  Psychological  Measurement.  ^5,  R-H 

p.  150  (Rev.) 


Fleischman,  H.  L.  Consumer  acceptance  of /programmed  instruction.  St. 
Louis,  Mo.:  Washington  University,  Department  of  Psychology,  1968.  Technical 
Report  No.  15  60-65. 


Fleiss,  J.  L.  Analysis  of  variance  methods  in  assessing  errors  in  inter- 
view data.  Dissertation  Abstracts.  1968,  ^(12-B),  5231. 


Fleiss,  J.  L.  Estimating  the  reliability  of  interview  data.  Psychometr ika , 
1970,  35  (2),  143-162. 


Fleiss,  J.  L.  Measuring  nominal  scale  agreement  among  many  raters. 
Psychological  Bulletin,  1971,  378-332. 

N/A 

N/A  A-H 


-89- 


Fletcher,  N.  C.,  & Shephard,  A.  H.  Interpretation  of  data  ns  a function 
of  units  of  measurement.  Canadian  Journal  of  Psychology . 1957,  65-70. 


N/A  18 

Psych.  Abst.  , 32_,  #3523  A-N 


* Flyer,  E.  S.,  & Carp,  F.  M.  The  Picture  Test:  Rationale  and  one  validation 
of  the  method.  Journal  of  Applied  Psychology . 1962,  226-227. 

Response  Bias,  Rating  Scales,  Clarity  12,  2,  4 

Psych.  Abst . , 37 . #1186  A-M 


Fogei,  L.  J.  Si  ^techno logy : Concepts  and  applications.  Englewood  Cliffs: 
Prentice-Hall,  1963  ■■  * - 


* Follman,  J.,  Urbanke,  R.,  & Burley,  W.  Comparison  of  three  matching  item 
formats.  Florida  Journal  of  Educational  Research.  1971,  1^,  39-48. 

Matching  Items,  Instrument  Format,  Response  Alternatives  3b,  3e 

ORA  R-M 


Ford,  LeR.  H.,  Jr.  A forced-choice,  acquiescence-free,  social  desirability 
(defensiveness)  scale.  Journal  of  Consulting  Psychology.  1964,  28(5),  475. 


Ford,  LeR.  H.,  Jr.,  & Meisels,  M.  Social  desirability  and  the  semantic 
differential.  Educational  and  Psychological  Measurement.  1965,  25(2)  . 
465-475. 


Ford,  N.  M.  The  advance  letter  in  mail  surveys. 

Research,  1967,  4,  202. 

Journal  of  Marketing 

Respondent's  Motivation 

11 

ORA  ' 

R-M 

-90- 


Ford,  N.  M.  Quescioimaire  appearance  and  response  rates  in  mail  surveys. 
Journal  of  Advert isin}’.  Research,  1968,  8,  4j-45, 


Respondent's  Motivation,  Instrument  Format  3g,  11 

ORA  R-M 


Ford,  N.  M.  Consistency  of  responses  in  a mail  survey.  Journal  of 
Advertising  Research,  1969,  £(4) , 31-33. 

Reliability,  Interviews,  Close-Ended  Items  1 

ORA  R-M 


Ford,  R.  N.  Scaling  experience  by  a multiple-response  technique:  A study 
of  white-Negro  contacts.  American  Sociological  Review,  1941,  6,  9-23. 


Ford,  R.  N.  A rapid  scoring  procedure  for  scaling  attitude  questions. 
Public  Opinion  Quarterly,  1950,  507-532. 

Attitude  Measures,  Scaling,  Data  Analysis  15,  14  8 

Psych.  Abst.  , #2106  R-N 


Forthman,  J.  H.  The  effects  of  a zero  interval  on  semantic  differential 
rotated  factor  loadings.  Journal  of  Psychology,  1973,  ^(1),  23-32. 


Foster,  R.  T.  Acquiescent  response  set  as  a measure  of  acquiescence. 
Journal  of  Abnormal  and  Social  Psychology,  1961,  155-160. 

Response  Bias,  Personality  Measures  12 

Psych.  Abst.  , 36 , #4HF55F  A-M 


Fester,  R.  J.,  & Grigg,  A.  E.  Acquiescent  response  set  as  a measure  of 
acquiescence:  Further  evidence.  Journal  of  Abnormal  and  Social  Psychology. 
1963,  ^(3),  304-306. 

Response  Bias  12 

Psych.  Abst . , 38 , #2503 


-91- 


A-N 


Fowler,  H.  M.  An  application  of  the  Ferguson  method  of  computing  item 
conformity  and  person  conformity.  Journal  of  Experimental  Education. 
1954,  22,  237-245. 


Frank,  B.  Stability  of  questionnaire  response.  Journal  of  Abnormal  and 
Social  Psychology,  1935,  320-324. 

Investigator  Errcr,  Respondent's  Motivation  7,  11 

Potter,  Sharpe,  Hendee  and  Clark,  1972  A-N 


Frankel,  L.  R.  How  incentives  and  subsamples  affect  the  precision  of  mail 
surveys.  Journal  of  Advertising  Research,  1960,  1^(1),  1-5. 


Franken,  R.  B.  Formulating  questionnaires.  Advertising  Fortnightly.  1924 
(a),  2,  23. 


Franken,  R.  B.  Formulating  questionnaires.  Advertising  Fortnightly.  1924 
(b),  2,  27-28. 


Franzen,  R.  The  construction  of  a questionnaire.  Market  Research.  1936 
4(5),  17-19  ■ 


Franzen,  R.  H.,  & Lazarsfeld,  P.  F.  Mail  questionnaire  as  a research  prob- 
lem. Journal  of  Psychology.  1945,  M,  293-320. 

* Frawley,  P.  J.  A study  of  judgment;  A factorial  analysis  of  the  anchoring 
effects . Washington,  D.  C.:  Catholic  University  of  American  Press,  1948. 

Rating  Scales,  Data  Analysis  3f,  8 

ORA  R-N 


* Frederiksen,  N.,  & Messick,  S.  Response  set  as  a measure  of  personality. 
Office  of  Naval  Research  Technical  Report.  Princeton,  New  Jersey:  Education- 
al resting  Service,  1958. 

Military  Personnel,  Response  Bias,  Investigator  Error  12,  7 

Psych ■ Abst . , 34 . #4065  A-M 


-92- 


* 


Freeberg,  N.  E. 
and  reliability 
518-524. 


Relevance  of  rater-ratee  acquaintance  in  the  validity 
of  ratings.  .Tnurnal  of  Applied  Psycholggj^,  1969, 


Investigator  Error,  Observations,  Raters,  Validity 
Journal  of  Applied  Psychology,  ?•  518 


Freeman,  L.  C.,  & Atao’v,  T. 
attitude  toward  cheating. 


Invalidity  of  indirect  and  direct  measures 
Journal  of  Personality,  1960,  ^ (4),  443-447 


of 


Question  Stem 

Potter,  Sharpe,  Hendee , and  Clark,  1972 


3g 

A-M 


Freidman  M.  P.,  & Fleishman,  E.  A.  A note  on  the  use  of  a don  t know 
alternative  in  multiple  choice  tests.  Journal  9 f Educational.  PsychglgaL. 
1956,  47,  344-349, 

Achievement  Measures,  Response  Alternatives,  Reliability  3g 

Psvch.  Abst.,  32,  #4598 


* French,  E.  G.  A note  on  the  Edwards  Preference  Schedule  ^ ^5 

basic  airmen.  F.ducational  and  Psychological  Measurement,  1958,  18,  109  U5, 


Investigator  Error,  Response  Bias,  Personality 
Measures,  Military  Personnel 

Psych.  Abst.  , 33 , #8340 


12,  13,  9 
A-M 


French,  J.  W.  A note  on  keying  and 
ment-test.  Psvchometrika , 1952,  12, 


item-selection  for  a practical  judg 

101-6. 


French,  J.  W. 
Bulletin  56-4. 


The  effect  of  essay  test  on  student  motivation.  Research 
Princeton,  New  Jersey:  Educational  Testing  Service,  1956. 


N/A 


N/A 


N/A 


T-M 


-93- 


French,  J.  W.  The  kinds  of  items  that  work  in  an  Interest  activities 
index . Research  Bulletin  64-36.  Princeton,  New  Jersey:  Educational  Test- 
ing Service,  1964. 

N/A  N/A 

N/A  T-M 


Freyd,  M.  An  appraisal  of  relative  merits  of  types  of  rating  scales  and 
their  use.  General  Management  Series  No.  38,  New  York:  American  Management 
Association,  1926. 

N/A  N/A 

N/A  T-H 


Fricke,  B.  G,  Response  set  as  a suppressor  variable  in  the  OATS  and  MMPI. 
Journal  of  Consulting  Psychology.  1956,  20,  161-169. 


Fricke,  B.  G.  Subtle  and  obvious  test  items  and  response 
Consulting  Psychology.  1957,  250-252. 

Response  Bias,  Personality  Measures,  Question  Stem 

Journal  of  Consulting  Psychology.  21,  p.  252 


set . Journal  of 

12 

R-M 


Friesen,  E,  P.  The  incomplete  sentences  technique  as  a measure  of  employee 
attitudes.  Personnel  Journal,  1952,  329-345. 


Frisbie,  D.  A.,  & Ebel,  R.  L.  Comparative  reliabilities  and  validities  of 
true-false  and  multiple  choice  tests  (ERIC  Document  Reproduction  Service, 

ED  064  388).  Paper  presented  at  the  Annual  Meeting  of  the  American  Educa- 
tional Research  Association,  Chicago,  1972. 

Achievement  Measures  18 


ERIC  Document  Reproduction  Service,  ED  064  388 


A-N 


* Frisbie,  B. , & Sudman,  S.  The  use  of  computers  in  coding  free  responses. 
Public  Opinion  Quarterly,  1968,  216-232. 

Check  List,  Open-Ended  Items,  Investigator  Error,  Scoring  2,  8,  10,  13 


OR 


R-H 


Fromme,  A.  On  the  use  of  certain  qualitative  methods  of  attitude  research. 
Journal  of  Social  Psychology,  1941,  1^,  425-459. 


Fry,  E.  A readability  formula  that  saves  time. 

n,(7),  575-578 

Journal  of  Reading,  1968, 

Clarity 

3g 

ORA 

A-M 

Fry,  J.  N.,  & Claxton,  J.  D.  Semantic  differential  and  nonmetric  multi- 
dimensional scaling  descriptions  of  brand  images.  Journal  of  Marketing 
Research.  1971,  8,  238-240. 

Semantic  Differential  Items,  Data  Analysis,  Card  Sorts  2 

ORA  R-M 


Fulkerson,  S.  C.  Individual  differences  in  response  validity.  Journal 
of  Clinical  Psychology,  1959,  1^,  169-173. 


Fuller,  C.  H.  Effect  of  anonymity  on  return  rate  and  response  bias  in  a 
mail  survey  (AD-746  478).  Washington,  D.  C.:  Naval  Personnel  Research 
and  Development  Laboar tories , 1972.  Report  No.  WTR-73-2. 


N/A 

N/A 

N/A 

T-M 

Fullerton,  G.  S.,  & Cattell,  J.  McK.  On  the  perception  of  small  differ- 
ences. Publ . Univ.  Penn.,  Phil.  Series,  1892,  No.  2. 


-95- 


r 


Furntratt,  E.  Response  tendencies  in  questionnaires:  I.  The  tendency  to 
agree  and  the  tendency  of  using  the  full  response  scale.  Psycho loglsche 
Rundschau , 1969,  ^(1),  1-18. 


Furst,  E.  J.  Constructing  evaluation  instruments.  New  York:  David  McKay, 
1958. 


Gadel , M.  S. 
of  response 
145-152. 


The  relationship  of  item  validity  shrinkage  to  curvilinearity 
distributions.  Educational  and  Psychological  Measurement,  1958, 


* Gage,  N.  L.,  & Chatterjee,  B.  B.  The  psychological  meaning  of  acquiescence 
set:  Further  evidence.  Journal  of  Abnormal  and  Social  Psychology.  1960,  60 
280-283. 

Response  Bias,  Investigator  Error  12,  3g 

Psych.  Abst. , 34,  #7548  A-M 


Gage,  N.  L.,  Leavitt,  G.  S.,  & Stone,  G.  C.  The  psychological  meaning  of 
acquiescense  set  for  authoritarianism.  Journal  of  Abnormal  and  Social 
Psychology . 1957,  98-103. 


* 


Gaito,  J.  Forced  and  free  Q sorts.  Psychological  Reports, 
251-254, 

Card  Sorts,  Forced  Choice  Items,  Respondent's  Motivation 
Psych.  Abst.  , J7^,  #1187 


1962,  10(1), 

2,  11 

A-M 


Gale,  E.  J.,  Jr.  Nonresponse  in  a mail  survey  of  naval  personnel  (AD  733 
377).  Monterey,  California:  Naval  Postgraduate  School,  1971. 


N/A 

N/A 


N/A 

T-M 


-96- 


s. 


J 


Gales  K.,  & Kendall,  M.  G.  An  inquiry  concerning  interviewer  variability. 
Journal  of  rlie  Roval  Statistical  Society,  1957  , Series  A,  (Part  II), 

121-147. 


Gallup,  G.  Question  wording  in  public  opinion  polls; 
raised  by  Mr.  Stegner.  Soc  iometry , 1941,  4,  259-268. 


Comments  on  points 


Gallup,  G.  A guide  to  public  opinion  polls, 
sity,  1944. 


Princeton:  Princeton  Univer- 


Gallup,  G.  Qualitative  measurement  of  public  opinion:  The  quintamensionaj^ 
plan  of  question  design.  Princeton,  N.  J.:  American  Institute  of  Public 

Opinion,  1947  (a). 


N/A 

N/A 

N/A 

T-H 

Gallup,  G.  The  quintamensional  plan  of  question  design.  Public  Opinion 
Quarterly , 1947  (b) , 385-393. 

1 3 

Investigator  Error 

Bureau  of  Census,  #7110008301  R'N 


Gallup,  G.  Public  opinion  surveys.  New  York,  1950. 


Gallup,  G.,  & Rae,  S.  F.  The  pulse  of  democracy.  New  York;  Simon  & Schuster, 
1940. 


Gannon  M.  J.  Proper  use  of  the  questionnaire  survey.  Business  Horizons, 
1973,  16,  89-94. 

Q 

Data  Analysis 


-97- 


Garcia-Mata,  C.  A practical  method  of  smoothing  statistical  curves  by 
hand.  Bull.  Pan  Amer.  Union,  1941,  75,  276-281. 


* Gardner,  E.  F.  Comments  on  selected  scaling  techniques  with  a description 
of  a new  type  of  scale.  Journal  of  Clinical  Psychology.  1950,  38-43. 

Scaling  14 

Psych.  Abst. , , #63  A-M 


Gardner,  P.  L.  Test  length  and  the  standard  error  of  measurement.  Journal 
of  Educational  Measurement.  1970,  7,  271-274. 


Gardner,  R.  A.  Multiple-choice  decision-behavior.  American  Journal  of 
Psychology , 1958,  710-717. 

Military  Personnel,  Multiple  Choice  Items,  Response  3a 

Alternatives 


Psych . Abst . , 34 , #801 


Garner,  W.  R.  Rating  scalti;'i,  discr  iminability , and  information  transmission. 
Psychological  Review,  1960,  67,  343-352. 


Garner,  W.  R.,  & Hake,  H.  W.  The  amount  of  information  in  absolute  judg- 
ments. Psychological  Review,  1951,  446-459. 

Questionnaire  Theory  and  Development  15 


Garry,  R,  Individual  differences  in  ability  to  fake  vocational  interests. 
Journal  of  Applied  Psychology,  1953,  33-37. 


Garvin,  A.  D.  Non-chance  results  from  a pure-chance  test:  A study  in  re- 
sponse position  selection  set  (ERIC  Document  Reproduction  Service,  ED  058 
284).  Paper  presented  at  the  Annual  Meeting  of  the  American  Psychological 
Association,  Washington,  D.  C.,  1971. 


Garvin,  A.  D. 
EDRS . ) 


Confidence  weiuhtinR. 


(ED  062  401,  MF  and  HC  available  from 


Garvin,  A.  D.  Confidence  weighting  plus  Coombs-type  response  options: 
A good  idea  that  failed-  (Eb  065  55l  , Iff  and  HC  available  from  Eote  .X” 


Catty,  R.  , 6c  Allais,  C.  The  semantic  differential  applied  to  image  research. 
New  Brunswick:  Rutgers  University  (undated). 


Gekoski,  N.  Psychological  testing:  Theory,  interpretation,  and  practices. 
Springfield,  111.:  Charles  C.  Thomas,  1969. 


Gekoski,  N.,  & Isard,  E.  S.  Note  on  another  use  of  the  sentence-completion 
technique.  Journal  of  Applied  Psychology,  1955,  39,  139. 


Gelfand,  N.  1.  An  investigation  of  interviewer  recall  of  interviewee  re- 
sponses. Dissertation  Abstracts  International . 1973,  ^ (10-B),  5043-5044. 


Gentile,  J.  R.,  & Seibel,  R.  A rating  scale  measure  of  word  relatedness. 
Journal  of  Verbal  Learning  and  Verbal  Behavior,  1969,  8(2),  252-256. 

Data  Analysis  15 

ORA  R-N 


Georgoff,  D.  M.,  Hersker,  H.  J.,  & Murdick,  R.  G.  The  lost-letter 
technique  --  a scaling  experiment.  Public  Opinion  Quarterly.  1972,  36(1), 
114-119. 

Attitude  Measures,  Investigator  Error  10 

ORA  R-N 


Gerberich,  J.  B.  A study  of  the  consistency  of  informant  responses  to 
questions  in  a questionnaire.  Journal  of  Educational  Psychology,  1947, 
38,  399-306. 


* Gerberich,  J.  B.,  & Mason,  J.  M.  Signed  versus  unsigned  questionnaire. 
Journal  of  Education  Research,  1948,  ^ (2),  122-126. 


Anonymous  Respondent  11 

Potter,  Sharpe,  Hendee,  and  Clark,  1972  A-H 


Getzels,  J.  W.,  & Walsh,  J.  J.  The  method  of  paired  direct  and  projective 
questionnaires  in  the  study  of  attitude  structure  and  socialization.  Psycho- 
logical Monographs,  1958,  T2^  (1)  (Whole  No.  454). 


Ghiselli,  E.  E.  Some  further  points  on  public  opinion  polls.  Journal  of 
Marketing , 1940,  _5,  115-119. 

Question  Stem,  Clarity,  Investigator  Error  4,  13 

ORA  R-N 


Ghiselli,  E.  E.  The  problem  of  question  form  in  the  measure  of  sales  by 
consumer  interviews.  The  Journal  of  Marketing,  1941,  170-171. 

Question  Stem,  Interviews,  Clarity  4,  3g 

ORA  R-N 


* Ghiselli,  E.  E.  All  or  none  versus  graded  response  questionnaires.  Journal 
of  Applied  Psychology,  1939,  2^,  405-413. 

Multiple  Choice  Items,  Response  Alternatives  2,  3a 

ORA  R-M 


Ghiselli,  E.  E.  The  measurement  of  occupational  aptitude.  Berkeley: 
University  of  California  Press,  1955. 


Ghiselli,  E.  E.  Theory  of  psychological  measurement.  New  York:  McGraw- 
Hill,  1964. 


-100- 


Personnel  and  industrial  psychology.  New 


* Ghiselli,  E.  E.,  & Brown,  C.  W. 
York:  McGraw-Hill,  1948. 

- -Textbook 

ORA 


Gibbins,  K.  Response  sets  and  the  semantic  differential.  British  Journal 
of  Social  and  Clinical  Psychology.  1968,  _7(4),  253-263. 


* Gibson,  F.  K.,  & Hawkins,  B.  W.  Interviews  versus  questionnaires, 
American  Behavioral  Scientist,  1968,  1^,  9-16. 


Interviews  1 

ORA  R'M 


Gilbert,  A.  R.  Superiority  of  latency-weighted  scores  over  weighted  scores 
in  the  assessment  of  professional  interests.  Psychological  Reports,  1970, 
^(1),  93-94. 


* Gilbert,  T.  F.  Experiments  in  morale.  Journal  of  Social  Psychology.  1956, 
43,  299-308. 

Interest  Measures,  Military  Personnel,  Response  Bias  9,  10,  12 

Psych.  Abst . , 33 , #4801  A-H 


Ginter,  J.  L.  An  experimental  investigation  of  attitude  change  and  choice 
of  a new  brand.  Journal  of  Marketing  Research,  1974,  )Q,  30-40. 


* Githens,  W.  H.  Influence  of  questionnaire  items  on  general  attitude  toward 
Tob . USN  PRA  Research  Memorandum  No.  65-13, 

Rating  Scales,  Attitude  Measures,  Military  Personnel,  3g 

Question  Stem 

Psych.  Abst.,  39 . #16  563  A-H 


-101- 


Gividen,  G.,  Nystrom,  C.  0.,  &Arsdell,  P.  M. , Jr.  Fort  Hood  semiannual 
project  VOLAR/MVA  evaluation  report.  (Preliminary  report.)  Office  of  the 
Deputy  Chief  of  Staff  for  Command  Programs,  III  Corps  and  Fort  Hood,  Fort 
Hood,  Texas,  1972. 


Glaser,  R.  A methodological  analysis  of  the  inconsistency  of  response  to 
test  items.  Educational  and  Psychological  Measurement.  1949,  £,  727-740, 


Glasser,  R.  Training  research  and  education,  Pittsburgh:  University  of 
Pittsburgh  Press,  1962, 


Gleser,  L.  J.  On  bounds  for  the  average  correlation  between  subtest  scores 
in  ipsative  scoring.  Educational  and  Psychological  Measurement.  1972,  32(3), 
759-765. 


Glickman,  A,  S.  Studies  in  career  motivation;  III.  Effects  of  repeated 
questionnaire  administration  on  returns  and  on  intended  and  actual  reenlist- 
ment (AD  633  598).  Washington,  D.  C.:  Bureau  of  Naval  Personnel,  Personnel 
Research  Division,  1962.  Report  No.  RS-62-5. 

Military  Personnel,  Respondent's  Motivation  11 

DDC,  #AD  633  598  A-M 


Gloye,  E.  E.  A note  on  the  distinction  between  social  desirability  and 
acquiescent  response  styles  as  sources  of  variance  in  the  MMPl.  Journal 
of  Counseling  Psychology.  1964,  11.(2),  180-184. 

Response  Bias,  Question  Stem,  Validity,  True-False  Items  12 

Psych.  Abst. . 39 . #5175  A-N 


Goheen,  H.  W.,  & Kavruck,  S.  Selected  references  on  test  construction. 
mental  test  theory  and  statistics.  Washington,  D.  C.:  U.S.  Civil  Service 

Commission,  1950. 


Goheen,  H.  W.,  and  Mosel,  J.  N.  Validity  of  the  employment  recommendation 
questionnaires:  II.  Comparison  with  field  investigations.  Personnel 
Psychology , 1959,  J^,  297-301. 


Goldberg,  L.  R.  Objective  diagnostic  tests  and  measures.  Annual  Review 
of  Psychology,  1974,  £5,  343-366. 


Goldberg,  L.  R.,  & Hase,  H.  D.  Comparative  validity  of  different  strategies 
of  constructing  personality  inventory  scales.  Psychological  Bulletin,  1967, 
^(4),  231-248, 


Golden,  G.  M. , & others.  Item  writing  rule  conformity  as  related  to  bio- 
graphical item  response  stability.  Paper  presented  in  a session  on  Testing 
Materials  at  the  Convention  of  the  Rocky  Mountain  Psychological  Association, 
Denver,  Colorado,  May  1971. 


Goldfried,  M R.,  & McKenzie,  J.  D. , Jr.  Sex  difference  in  the  effect  of 
item  style  on  social  desirability  and  frequency  of  endorsement.  Journal  of 
Consulting  Psychology,  1962,  ^(2),  126-128. 


* Goldsamt,  M.  R.  Effects  of  scoring  method  and  rating  scale  length  in 

extreme  response  style  measurement.  Dissertation  Abstracts  International. 
1972,  ^(10-B),  6030. 

Scoring,  Rating  Scales,  Response  Alternatives, 

Response  Bias  3a,  8,  9,  12 

ORA  R-NA 


Goldstein,  I.  L.  The  application  blank:  How  honest  are  the  responses? 

Journal  of  Applied  Psychology,  1971,  ^ (5),  491-492. 


Goldstein,  M.  J.  The  social  desirability  variable  in  attitude  research. 
Journal  of  Social  Psychology,  I960,  103-108. 


-103- 


Goode,  P.  V.  How  to  get  better  results  from  attitude  surveys.  Personnel 
Journal,  1973,  52  (3),  187-192. 


Goode,  W.  J.,  & Hatt,  P.  Methous  in  social  research.  New  York:  McGraw- 
Hill,  1952. 


Goodenough,  W.  A.  A technique  for  scale  analysis.  Educational  and  Psycho- 
logical Measurement,  1944,  4,  179-190. 


Goodnow,  J.  J.  Response  sequences  in  a pair  of  two-choice  probability 
situations.  American  Journal  of  PsycholoRV,  1955,  624-630. 


Goodstein,  L.  D.,  & Heilbrun,  A.  B.  Social  desirability  response  set: 
Error  or  predictor  variable.  Journal  of  Psychology,  1961,  321-9. 


Gordon,  R.  L.  Interviewing:  Strategy,  techniques,  and  tactics.  Homewood, 
111.:  Dorsey,  1969. 


* Gordon,  L.  V.  Validities  of  the  forced-choice  and  questionnaire  methods 
of  personality  measurement.  Journal  of  Applied  Psychology,  1951,  3_5,  407- 
412. 

Forced  Choice  Items,  Personality  Measures,  2,  8,  14 

Validity,  Closed-Ended  Items 

Psych.  Abst. , '2J_,  #424  A-M 


Gordon,  L.  V.  Validity  of  scoring  methods  for  bipolar  scales.  Education- 
al and  Psychological  Measurement,  1967,  27(4,  Pt.  2),  1099-1106. 


Gordon,  L.  V.  Are  there  two  extremeness  response  sets?  Educational 
and  Psychological  Measurement,  1971,  31(4),  867-873. 


-104- 


* Gordon,  M.  E.  An  examination  of  the  relationship  between  the  accuracy 
of  favorability  of  ratings.  Journal  of  Applied  Psychology,  1972,  56(1), 
49-53. 

Investigator  Error 

Journal  of  Applied  Psychology,  56 , p.  49  R-M 

(Rev.  from  rept.) 


Gorfein,  D.  S.  Scaling  theory  and  group  influence:  A re-examination. 
Journal  of  Social  Psychology,  1964,  ^(2),  303-308. 

Scaling,  Attitude  Measures  3f,  10 

Psych.  Abst. , 2^,  #4862  A-M 


Gotkin,  L.  G.,  & Goldstein,  L.  S.  Descriptive  statistics,  a programmed 
textbook.  New  York:  John  Wiley,  1964. 


Gough,  H.  G.  The  adjective  check  list  as  a personality  assessment  research 
technique.  Psychological  Reports,  1960,  6,  107-122. 

Personality  Measures  18 

Psych.  Abst. , 34 , #7386  A-N 


Gough,  H.  G.,  & Heilbrun,  A.  B.  Adjective  Check-list  Manual.  Palo  Alto: 
Consulting  Psychologists  Press,  1965. 

Personality  Measures  18 


ORA 


T-H 


Gough,  H.  G.,  McKee,  M.  G.,  & Yandell,  R.  J.  Adjective  checklist  analyses 
of  a number  of  selected  psychometric  and  assessment  variables.  Maxwell  ARB, 
Alabama:  Officer  Education  Research  Laboratory,  1955.  Technical  Memorandum 
OERL-TM-10. 

N/A 

N/A  I'l* 


1 A C 


Green,  P.  E.,  & Carmone,  F.  J.  Multidimensional  sralinR  and  related 
techniques  in  marketing  aualysls.  Boston;  Allyn  and^  2?.con , 1970. 


Green,  P.  E.,  Maheshwari,  A.,  & Rao,  V.  R.  Self-concept  and  brand  pr^er- 
ence : An  empirical  application  of  multidimensional  scaling.  Journal  of.' 
Marketing  Research,  1969,  _H,  343-360.  ; 


* Green,  P.  E.,  & Rao,  V.  A.  Rating  scales  and  information  recovery  - How 
many  scales  and  response  categories  to  use?  Journal  of  Marketing,  1970,  34 , 
33-39. 


Rating  Scales,  Response  Alternatives 
ORA 


3a 

R-M 


-106- 


Green,  P. , & Tull,  D. 
Prentice-Hall,  1970. 


^£earch  for  Marketing  Decisions.  New  York: 


Green  , P . E . , Wind , 
marketing  research. 


Y.,  & Jain,  A.  K.  Analyzing  f ree-response  data  in 
Journal  of  Marketing  Research.  1973,  9,  45-52. 


•V 


Green,  R.  F.  Does  a selection  situation  induce  testees  to 
answers  on  interest  and  temperament  measures?  Educational 
1951,  U,  503-515.  


bias  their 
and  Psycholoeica 


1 


Preference  Measures,  Response  Bias,  Investigator  Error 


12,  7 


ORA 


R-M 


Green,  R.  F.,  & Goldfried,  M.  R.  On  the  bipolarity  of  semantic  space. 
Psychological  Monographs:  General  and  Applied.  1965",  79  (6,  Whole  No.  599). 

Semantic  Differential  Items  Ig 

Psych.  Abst . , 39,  #9441  a_« 


Green,  R.  T.,  & Stacey,  B.  G.  The  response  style  myth:  An  empirical  study 
involving  the  T-scale.  Acta  Psychologica . Amsterdam.  1966,  25(4),  365-372. 


Greenberg,  A.  L.  Respondent  ego-involvement  in  large  scale  surveys. 
Journal  of  Marketing.  1956,  390-393. 

Interviews,  Respondent's  Motivation,  Projective  Items,  1 2 11 

Open-Ended  Items,  X-0  Test  Items  ’ 


Greenberg,  A.  Validity  of  a brand  awareness  question.  Journal  of  Marker. 
ing,  1958,  n,  182-184.  


-107- 


Greenberg,  A.  Pictorial  stereotypes  in  a projective  test.  Journal  of 
Marketing,  1959,  72-74. 


Projective  Items 


15 


ORA 


R-N 


Greenberg,  A.  Paired  Comparisons  vs.  monadic  tests.  Journal  of  Advertising 
Research , 1963,  ^(4),  44-47. 

Paired  Comparison  Items,  Rating  Scales,  Preference  2,  3c 

Measures 

ORA  R-H 


Greenberg,  M.  G.  A modification  of  Thurstone's  Law  of  Comparative  Judgment 
to  accommodate  a judgment  category  of  'equal'  or  'no  difference'.  Psycho- 
logical Bulletin,  1964,  108-12. 

Response  Alternatives  3g 


Psychological  Bulletin,  64 , p.  108 


R-NA 


Greenwald,  H.  J.,  & O'Connell,  S.  M.  Comparison  of  dichotomous  and  likert 
formats.  Psychological  Reports.  1970,  ^(2),  481-482. 

True-False  Items,  Rating  Scales,  Reliability  2 

ORA  R-H 


Greenwood,  J.  A.  A preferential  matching  problem.  Psychometrika , 1943, 
8,  185-191. 


Gregson,  R.  A.  Representation  of  taste  mixture  cross-modal  matching  in  a 
Minkowski  r-metric.  Australian  Journal  of  Psychology,  1965,  2Z(3) , 195-204. 

Paired  Comparison  Items,  Clarity,  Instrument  Format,  4,  2,  3g,  8 

Matching  Items,  Data  Analysis 


Psych.  Abst.  , y/4734 


A-M 


Grice,  H.  H.  The  construction  and  validation  of  a generalized  scale  to 
measure  attitude  toward  defined  groups.  Bulletin  Purdue  University.  1934, 
25,  37-46. 


Gridgeman,  N.  T.  Significance  and  adjustment  in  paired  comparisons. 
Biometrics , 1963,  1^(2),  213-228. 


Gritten,  F.,  & Johnson,  D.  M.  Individual  differences  in  judging  multiple- 
choice  questions.  Journal  of  Educational  Psychology.  1941,  32,  423-430. 


* Gross,  E.  J.  The  effect  of  question  sequence  on  measures  of  buying  interest. 
Journal  of  Advertising  Research,  1964,  4,  41. 


Instrument  Format,  Preference  Measures,  Investigator 
Error 


3c 


ORA 


R-H 


* Guber,  J.  F.,  & Gerber ich,  J.  B.  A note  on  consistency  in  questionnaire 
responses.  American  Sociological  Review.  1946,  JA(1)  , 13-15. 


Investigator  Error 

Bureau  of  Census,  #7110000601  (Rev.) 


13 

A-H 


Guest,  L.  A study  of  interviewer  competence.  International  Journal  of 
Opinion  and  Attitude  Research.  1947,  1(4),  17-30. 


Guest,  L.  A comparison  of  two-choice  and  four-choice  questions.  Journa 1 
of  Advertising  Research.  1962  (a),  2,  32-34. 


Instrument  Format,  Multiple  Choice  Items,  Response 
Alternatives 


3a,  2 


ORA 


R-H 


Guest,  L.  Consumer  analysis.  Annual  Review  of  PsycholoRv , 1962  (b) , 13 
315-344. 


Guest,  L.,  6t  Nuckols,  R.  A laboratory  experiment  in  recording  in  public 
opinion  interviewing.  International  Journal  of  Opinion  and  Attitude 
Research , 1950,  4,  336-352. 


* Guilford,  J.  P.  The  method  of  paired  comparisons  as  a psychometric  method. 
Psychological  Review,  1928,  494-506. 

Questionnaire  Theory  and  Development,  Scaling  14 

Psychological  Review,  35,  p.  506  R-N 


Guilford,  J.  P.  The  difficulty  of  a test  and  its  factor  composition, 
Psychometrika , 1941,  67-77. 


Guilford,  J.  P. 
1954. 

Psychometric  methods.  (2nd  ed.l 

New  York:  McGraw-Hill, 

Textbook 

17 

ORA 

R-H 

Guilford,  J.  P.  Personality.  New  York:  McGraw-Hill,  1959. 


Guilford,  J.  P.  Fundamental  statistics  in  psychology  and  education.  (4th 
ed.)  New  York:  McGraw-Hill,  1965. 


Guilford,  J.  P.,  & Jorgensen,  A.  P.  Some  constant  errors  in  ratings. 
Journal  of  Experimental  Psychology,  1938,  43-57. 


14 


Scaling 

Psych . Abst . . 12 , #2289  (Rev.  from  rept.) 


R-M 


Guilford,  J.  P.,  & Lacey,  J.  I.  Printed  c lassif ica tion  tests.  Washington, 
D.  C.:  Government  Printing  Office,  1947.  Research  Report  No.  5,  AAF  Psycho 
logy  Program. 


Guinn,  N.  Development  of  non-cogni tive  measures  for  use  in  officer  selec- 
tion (DF231630) . Lackland  AFB,  Texas:  Air  Force  Human  Resources  Laboratories, 
1973. 

N/A  N/A 

N/A  R-NA 


Gullahorn,  J.  E.,  & Gullahorn,  J.  T.  An  investigation  of  the  effects  of 
three  factors  on  response  to  mail  questionnaire.  Public  Opinion  Quarterly. 
1963,  27,  294-296. 


Gulliksen,  H.  Paired  comparisons  and  the  logic  of  measurement.  Psychologi- 
cal Review,  1946,  199-213. 

Questionnaire  Theory  and  Development  18 

ORA  R-N 


Gulliksen,  H.  The  reliability  of  a speeded  test.  American  Psychologist. 
1949,  4,  243. 


Gulliksen,  H.  Theory  of  mental  tests.  New  York:  John  Wiley,  1950. 


Gulliksen,  H.  Measurement  of  subjective  values.  Psychometrika , 1956,  21, 
229-244. 


Questionnaire  Theory  and  Development,  Data  Analysis 
Psychometrika , 33 , p.  229 


15 

R-N 


-111- 


Gulliksen,  H.  How  to  make  meaning  more  meaningful.  Contemporary  Psychol- 
ogy, 1958,  3,  115-119. 

Semantic  Differential  Items,  Data  Analysis,  Scaling  3a,  8 

ORA  R-N 


Gulliksen,  H.  Attitudes  of  different  groups  toward  work,  aims,  goals,  and 
activities . Princeton,  N.  J.:  Princeton  University,  Dept,  of  Psychology. 

Contract  N00014-67-A-0151-0006 . 


* Gulliksen,  H.,  & Messick,  S.  (Eds.)  Psychological  scaling:  Theory  and 
applications.  New  York;  John  Wiley  and  Sons,  1969. 

Scaling  14 

ORA  R-H 


Gulliksen,  H.,  & Tucker,  L.  R.  A general  procedure  for  obtaining  paired 
comparisons  from  multiple  rank  orders.  Psychometrika . 1961,  197-183. 


* Gustav,  A.  Students'  preferences  for  test  format  in  relation  to  their 
test  scores.  Journal  of  Psychology,  1964,  ^(1),  159-164. 

True-False  Items,  Multiple  Choice  Items,  Open-Ended  Items  2,  9,  10 

Psych.  Abst.  . 39,  #2890  A-N 


Guttman,  L.  A basis  for  scaling  qualitative  data.  American  Sociological 
Review , 1944,  9_,  139-150. 

Questionnaire  Theory  and  Development,  Scaling  14 

ORA  R-N 


* Guttman,  L.  The  Cornell  technique  for  scale  and  intensity  analysis. 
Educational  and  Psychological  Measurement.  1947  (a),  2>  247-280. 

Scaling,  Scoring  14,  8 

ORA  R-M 


-112- 


r 


Guttman,  L.  Suggestions  for  further  research  in  scale  and  intensity  analy- 
sis of  attitudes  and  opinions.  International  Journal  of  Opinion  and  Attitude 
Research , 1947  (b) , > 30-35. 


Guttman,  L.  On  Festinger's  evaluation  of  scale  analysis.  Psychological 
Bulletin , 1949,  451-465. 

Literature  Review,  Scaling,  Questionnaire  Theory  and  15,  16 

Development 

ORA  R'N 


Guttman,  L.  Measuring  the  true  state  of  opinion.  Berber,  R.,  6c  Wales, 

H.  (Eds.),  Motivation  and  Behavior.  Homewood,  111.:  Richard  D.  Irwin,  1958. 


Guttman,  L.  The  basis  for  scalogram  analysis.  In  Stouffer  S.  A.  (Ed.), 
Measurement  and  prediction.  Princeton,  N-  J-:  Princeton  University  Prpss, 

1950. 


Guttman,  L.,  & Suchman,  E.  A.  Intensity  and  a zero  point  for  attitude 
analysis,  American  Sociological  Review,  1947,  1^,  57-67. 

Attitude  Measures,  Data  Analysis,  Questionnaire  12,  2,  8 

Theory  and  Development,  Response  Bias,  Military  Personnel 

ORA  R'M 


Haagen,  C.  H.  Synonymity,  vividness,  familiarity,  and  association  value 
ratings  of  400  pairs  of  common  adjectives.  Journal  of  Psychology.  1949,  27 
453-463. 

Adjectives,  Paired  Comparison  Items  6 

ORA  R-M 


* Haire,  M.  Projective  techniques  in  marketing  research.  Journal  of  Market- 
ing. 1950,  14,  649-656. 


Hake,  H.  W.,  & Garner,  W.  R.  The  effect  of  presenting  various  numbers  of 
discrete  steps  on  scale  reading  accuracy.  Journal  of  Experimental  Psychol- 
ogy, 1951,  358-366. 

Response  Alternatives  3a,  8 

Journal  of  Experimental  Psychology.  42 . pp . 365-366  (Rev.)  R-M 


Hall,  E.  Item  writing  in  empirical  studies.  Educ  , Res . . 1969,  11(3) , 
223-225. 


Hall,  R.  F.  An  application  of  unfolding  theory  to  the  measurement  of 
attitudes.  Educational  and  Psychological  Measurement,  1970,  30(3),  621-37. 


Haller,  T.  P.  Let's  not  bury  paired  comparisons.  Journal  of  Advertising 
Research,  1966,  6(3),  29-30. 

Paired  Comparison  Items  2 

ORA  R-N 


Halpern,  G.  Item  arrangement  and  bias  in  an  interest  test.  Research 
Bulletin  67-39.  Princeton,  New  Jersey:  Educational  Testing  Service,  19&’> '. 

N/A  N/A 

N/A  T-H 


Halpern,  R.  S.  Some  observations  about  attitudes,  attitude  measurement, 
and  behavior.  In  adler,  L.,  & Crespi,  J.  (Eds.),  Attitude  Research  on  the 
Racks . Chicago:  American  Marketing  Association,  1968. 


Hambleton,  R.  K.  The  effects  of  item  order  and  anxiety  on  test  performance 
and  stress  (ERIC  Document  Reproduction  Service,  Ed  179  960).  Paper  presented 
at  the  annual  meeting  of  the  American  Educational  Research  Association,  Chicago, 
1968. 

Achievement  Measures  18 

ERIC  Document  Reproduction  Service,  ED  017  960  A-M 


-114- 


* Hamel,  L.,  & Reif,  H.  G.  Should  attitude  questionnaires  be  signed?  Person- 
nel Psychology,  1952,  5(2),  78-91. 


Anonymous  Respondent,  Investigator  Error 
Psych . Abs t . , 27,  #4646 


11,  9 


Hamil,  P.  N.  Word  meanings  and  self-descr iptions . Journal  of  Social 
PsycholoKV , 1969,  79,  51-54 


Hamilton,  C.  H.  Bias  and  error  in  multiple-choice  tests.  Psychometrika . 
1950,  15,  151-168. 


Data  Analysis,  Response  Al  arnatives 
Psych.  Abst.  , #693 


8,  3a 


Hammock,  J.  C.  Anxiety  scales  for  use  in  army  training  research  (AD480314) 
Human  Resources  Research  Organization,  Training  Methods  Division,  Staff 
Memorandum,  1954. 


Hammond,  K.  R.  Measuring  attitudes  by  error-choice:  An  indirect  method. 
Journal  of  Abnormal  and  Social  Psychology,  1948,  _9>  38. 


Hancock,  J.  W.  An  experimental  study  of  limiting  response  on  attitude 
scales.  In  Returners,  H.  H.  (Ed.),  Further  Studies  in  Attitudes,  Series  Ill- 
Studies  in  Higher  Education.  XXXIV.  Lafayette,  Ind.:  Purdue  University, 
1938.  Pp.  142-148. 


Hancock,  J.  W.  An  experimental  study  of  four  methods  of  measuring  unit 
cost  of  obtaining  attitudes  toward  a retail  store.  Journal  of  Applied 
Psychology , 1940,  213-230. 


F 


Hand,  J.  Comment  on  "Acquiescence,  social  desirability  and  inhibition 
reflected  by  'response  set'  scales."  Psychological  Reports,  1963,  13(3) . 
662. 


Hand,  J.  Measurement  of  response  sets.  Psychological  Reports.  1964,  14(3)  , 
907-913. 


Handy,  U.,  & Lentz,  T.  F.  Item  value  and  test  reliability.  Journal  of 
Educational  Psychology,  1934,  703-708. 

Reliability  5,  14 

ORA  R-N 


Handyside,  J.  A.  A general  introduction  to  attitude  scaling.  London: 
Market  Research  Society,  1960. 


Hanley,  C.  Deriving  a measure  of  test-taking  defensiveness.  Journal  of 
Consulting  Psychology,  1957,  21.,  391-397. 


Hanley,  C.  Responses  to  the  wording  of  personality  test  items.  Journal 
of  Consulting  Psychology,  1959,  261-265. 


Hanley,  C.  Personality  item  difficulty  and  acquiescence.  Journal  of 
Applied  Psychology,  1965,  , 205-208. 

Response  Bias,  Investigator  Error,  Personality  Measures  12,  8 

Psych.  Abst.  , #12286  A-H 


Hansen,  M.  H.,  Horowitz,  W.  N.,  & Madow,  W.  G.  Sample  survey  methods  and 
Theory . Vol.  I.  New  York:  John  Wiley,  1960. 


Hansen,  M.  H.,  & Steinberg,  J.  Control  of  error  in  surveys.  Biometrics . 
1956,  12,  462-474. 


-116- 


* Hanson,  R.  H.,  & Marks,  E.  S.  Influence  of  the  interviewer  on  the  accur- 
acy of’survey  results.  Journal  of  the  American  Stati_stical  Associati^, 
1958,  635-655. 

Clarity,  Interviews,  Investigator  Error  4,13 

Psych.  Abst. , 33 , #i0.132  ^ ^ 


Hardin,  E.,  & Hershey,  G.  L.  Accuracy  of  employee  reports  on  changes  in 
pay.  Journal  of  Applied  Psychology,  1960,  269-‘275. 


Harding,  F.  D.,  & Madden,  J.  M.  Analysis  of  some  aspects  of  the  Air 
Force  position  evaluation  system.  USAF  WADD  Personnel  Lab.  Technical  Note 


No.  60-143,  1960. 

N/A 

N/A 

N/A 

T-M 

Harding,  F.  D.,  Madden,  J.  M.,  & Colson,  K.  Analysis  of  a job  evaluation 
system.  Journal  of  Applied  Psychology,  1960,  44,  354-357. 


Harding,  L.  W.  A value-type  generalizations  test.  Journal  of  Social 
Psychology , 1944  (a),  j_9,  53-79. 


Harding,  L.  W.  A value-type  problemmaire . Journal  of  Social  Psychology, 
1944  (b),  19,  115-144. 


Hardt  F.  A.  Comparability  and  accuracy  of  interview  and  questionnaire. 
Catalogue  of  Selected  Documents  in  Psychology,  1971,  ^ (1). 


Hardt,  R.  H.,  & Bodine,  G.  E.  Development  of  se If-report- ins truments  in 
delinquency  research:  A conference  report.  Syracuse,  New  York:  Syracuse 
University,  1965. 


Hare,  P.  A.  Interview  responses:  Personality  or  conformity?  Public  Opinion 
Quarterly , 1960,  ^(4),  679-685. 


Attitude  Measures,  Interviews,  Response  Bias,  Validity,  7,  10,  12,  9 

Investigator  Error 

ORA  R-M 


Harmon,  L.  W.  Sexual  bias  in  interest  measurement.  Measurement  and 
Evaluation  in  Guidance,  1973,  3(^)  , 496-501. 


Harms,  T.  Evaluating  settings  for  learning.  Young  Children,  1970,  25(5) . 
304-308. 


Harris,  F.  J.,  Howell,  M.  A.,  & Newman,  S.  H.  Forced  choice  tetrads-effect 
of  scoring  procedure  and  key  length  on  validity  and  reliability.  Educa  t ion- 
al  and  Psychological  Measurement,  1956,  J^(4)  , 454-464. 


Harris,  M.  , ( Connelly,  G.  M.  Introducing  a symposium  on  interviewing 
problems.  International  Journal  of  Opinion  and  Attitude  Research,  1948, 
2(1),  70-84. 


Harris,  R.  J.  Deterministic  nature  of  probabilistic  choices  among  identi- 
fiable stimuli.  Journal  of  Experimental  Psychology,  1969,  7£(3,  Pt.  1), 
552-560. 


Harrison,  R.  L. 

Workers ' 

perceptions 

and  job  success:  Use  of  forced-choice 

questionna ires . 

Personnel 

Psychology , 

1959,  12,  619-25. 

N/A 

N/A 

ORA 

R-N 

Hart,  C.  W.  Bias  in  interviewing  in  studies  of  opinions,  attitudes,  and 
consumer  wants.  Proceedings  of  the  American  Philosophica 1 Society,  1948, 
92(5),  399-404. 


Response  Bias,  Investigator  Error,  Interviews 


12,  13 


* Hart,  G.  L.,  Faust,  R.  A.,  Jr.,  Rowland,  G.  E.,  & Lucier,  R.  0. 

Attitudes  of  troops  in  the  tropics.  Volume  II.  Methodoloeical  implications, 


Haddonfield,  New  Jersey:  Rowland,  1964.  R&C  Report  No.  64-19. 


Attitude  Measures,  Projective  Items,  Check  List, 
Military  Personnel,  Respondent's  Motivation,  Rating 
Scales,  Semantic  Differential  Items,  Open-Ended  Items 


2,  11,  9,  3g 


Hartson,  L.  D.  Influence  of  level  of  motivation  on  the  validity  of  intelli- 
gence tests.  Educational  and  Psychological  Measurement.  1945,  5,  273-283. 


Hasler,  K.  R.  Importance  of  descriptive  validity  in  psychological  measure- 
ment. Personnel  Journal,  1972,  ^ (1),  12-16. 


Hastorf,  A.  H.,  & Piper,  G.  W.  A note  on  the  effect  of  explicit  instruc- 
tions on  prestige  suggestion.  Journal  of  Social  Psychology,  1941,  33 , 
289-293. 


Attitude  Measures,  Response  Bias,  Investigator  Error 
Psych . Abs  t . . 26,  y/6069 


12,  13,  7 


Hauck,  M.  Is  survey  postcard  verification  effective?  Public  Opinion 
Quarterly . 1969,  33(1),  117-120. 


Hawkins,  L.,  & Coble,  J A.  The  problem  of  response  error  in  interviews- 
Chapter  3.  In  Lansing,  J.  B.,  Withy,  S.  B.,  Wolfe,  A.  C.,  et  al..  Working 
papers  on  survey  research  in  poverty  areas.  Ann  Arbor,  Mich.:  University 
of  Michigan,  Institute  for  Social  Research,  Survey  Research  Center,  1971. 


Hay,  N.  Establishing  factor  scales.  Personnel , 1946,  23(2)  . 


-119- 


Heilbrun,  A.  B.,  & Goodstein,  L.  D.  Relationships  between  personal  and 
social  desirability  sets  and  performance  on  the  Edwards  Personal  Preference 
Schedule.  Journal  of  Applied  Psychology.  1959,  43,  302-305. 

Response  Bias  2^2 

Psych.  Abst. , 34,  #5719  A-N 


Heise,  D.  R.  Some  methodological  issues  in  semantic  differential  research. 
Psychological  Bulletin.  1969,  72,  406-422. 


Heller,  K.  , Davis,  J.  D.,  & Myers , R.  A.  The  effects  of  interviewer  style 
in  a standardized  interview.  Journal  of  Consulting  Psvcholoev.  1966  30f6'i 

501-503  

Helma,  W.  H.  Validation  of  interest  scales  for  construction  and  mechanical 
jobs . Washington,  D.  C.l  Army  Personnel  Research  Office,  1967. 

Interest  Measures  18 

NTIS  A-N 


Hfclme,  W.  H.,  Kotula,  L.  J.,  & Fitch,  D.  J,  Preliminary  evaluation  of 
measures  to  predict  Army  reenlistment.  USA  TAG,  R&D  Command,  Human  Factors 
Research  Branch,  1960.  Technical  Research  Note  No.  110. 

Military  Personnel,  Scoring  14 

Psych.  Abst.  , #2LD14K  A-N 


Helson,  H.  An  experimental  investigation  of  the  effectiveness  of  the 
"big  lie"  in  shifting  attitudes.  Journal  of  Social  Psvcholoev.  1958  48 

51-60.  ’ — 


Helson,  H.,  & Kozaki,  A.  Anchor  effects  using  numerical  estimates  of  simple 
dot  patterns.  Perceptual  Psychophysioloev . 1968,  4,  163-164. 

Investigator  Error,  Clarity  3f ^ 4 

Perception  and  Psychophysics.  4,  p.l63  (Rev.  from  rept.)  R-H 


-121- 


Kelson,  H.,  Michels,  W.  C.,  & Sturgeon,  A.  The  use  of  comparative  rating 
scales  for  the  evaluation  of  psychophysical  data.  American  Journal  of 
PsycholoRv , 1954,  bl_,  321-326. 


Hembree,  H.  W.  Dimensionality  of  soldier  acceptance:  An  approach  to 
critical  research"^  College  Park,  Maryland:  University  of  Maryland.  Techni- 
cal Report  No,  10,  QM  Contract  DA  44-109-QM-129 . 

N/A  N/A 

N/A  T-M 


Hemkee , H.  W.,  & McDermott,  W.  G.  An  experimental  investisation  of  syn- 
thetic ruffs.  Quartermaster  General,  R6cD  Division,  Environmental  Protection 
Branch,  1952.  Report  No.  196-DA. 


N/A  N/A 

N/A  T-M 


Hendrick,  C.,  et  al.  Effectiveness  of  ingratiation  tactics  in  a cover  letter 
on  mail  questionnaire  response.  Psychonomic  Science,  1972,  2^  (6),  349-351, 


Hendrickson,  G.  F.  The  effect  of  differential  option  weighting  on  multiple- 
choice  objective  tests.  Journal  of  Educational  Measurement,  1971,  8,  291-296. 


Heneman,  H.  G.,  Jr.,  & Paterson,  D.  G.  Refusal  rates  and  interviewer  quality. 
International  Journal  of  Opinion  and  Attitude  Research,  1949,  2(3) j 392-398. 


Henmon,  V.  A.  C.  The  relation  of  the  time  of  a judgment  to  its  accuracy. 
Psychological  Review,  1911,  28>  186-201. 


Henry,  A.  F.,  & Borgatta,  E.  F.  A consideration  of  some  problems  of  con- 
tent identification  mi  scaling.  Public  Opinion  Quarterly,  1956,  20,  457-469. 


Henry,  H.  Belson's  studies  in  readership.  Journal  of  Advertising  Research, 
2(2),  9-14. 


* Heron,  A.  The  effects  of  real-life  motivation  on  questionnaire  response. 
Journal  of  Applied  PsycholoRV,  1956,  M (2),  65-68. 


Card  Sorts,  Personality  Measures,  Respondent's 
Motivation,  Investigator  Error 

Psych.  Abst. , 31,  #6731  (Rev.  from  rept.) 


3g,  7,  10,  11 


Hess,  R.  D. , & Hink,  D.  L.  A comparison  of  forced  vs.  free  Q-sort  procedure, 
Journal  of  Educational  Research,  1959,  5^,  83-90. 

Card  Sorts  2 

Psych.  Abst.  , 35 , #786  (Rev.  from  rept.)  R-H 


Hevner,  K.  An  empirical  study  of  three  psychophysical  methods.  Journal  of 
General  Psychology,  1930,  4,  191-212. 


Hevner,  K.  A method  of  correcting  for  guessing  in  true-false  tests  and 
empirical  evidence  in  support  of  it.  Journal  of  Social  Psychology.  1932,  2> 
359-362. 


* Hicks,  L.  E.  Some  procedures  of  ipsative,  noirmative,  and  forced-choice 
normative  measures.  Psychological  Bulletin,  1970,  74,  167-184. 


Forced  Choice  Items,  Investigator  Error, 

Literature  Review,  Scoring 

Psychological  Bulletin,  74,  p.  167  (Rev.  from  rept.) 


2,  13,  8 


Highland,  R.  W.,  & Berkshire,  J.  R.  A methodologica 1 study  of  forced-choice 
performance  ratings.  Bulletin  No.  51-59.  USAF  Human  Resources  Research 
Center,  1951. 


Hildum,  D.  C.,  & Brown,  R.  W.  Verbal  reinforcement  and  interviewer  bias. 
Journal  of  Abnormal  and  Social  Psychology,  1956,  ^(1),  108-111, 

Interviews,  Investigator  Error,  Respondent's  Motivation  13,  11 

Psych.  Abst. , 3^,  #1480  A-M 

Hill,  R.  J.  A note  on  inconsistence  in  paired  comparison  judgments. 
American  Sociological  Review,  1953,  1^,  564-566. 

Paired  Comparison  Items,  Response  Bias  8,  10,  12 

Psych.  Abst . , 28 , #5185  (Rev.  from  rept.)  R-H 


Hillmer,  M.  L. , Jr.  Social  desirability  in  a two-choice  personality  scale. 
(Doctoral  dissertation.  University  of  Washington)  Seattle,  Washington , June  1958. 

Response  Bias,  Personality  Measures,  Rating  Scales,  12,  2 

True -False  Items 

ORA  A-NA 


Hills,  S.  L. 
interviews  in 
47-48. 


Research  note  increasing  the  response  rate  for  structured 
community  research.  American  Behavioral  Scientist.  1968,  11(3) . 


Hilton,  D.  W.  Response 
tive  rating  instrument. 
2182. 


sets  as  they  relate  to  item  direction 
Dissertation  Abstracts  International. 


in  an  adjec- 
1970,  n(5-A), 


Himelsteln,  P.,  & Blaskovics,  T.  L.  Prediction  of  an  intermediate  criterion 
of  combat  effectiveness  with  a biographical  inventory.  Journal  of  Applied 
Psychology , 1960,  166-168. 


* Hinckley,  E.  D.  The  influence  of  individual  opinion  on  conslruction  ot  an 
attitude'scale.  Journal  of  Social  PsycholoRy , 1932  (a),  3,  283-2%. 

Attitude  Measures,  Card  Sorts,  Investigator  12.  10 

Error,  Scaling 


Hinckley , E . D . The  influence  of  individual  opinion  on  construction  i>  J 
attitude  scale.  Chicago:  University  of  Chicago  Library,  1932  (b) . 


Hinckley,  E.  D.  A follow-up  study  on  the  influence  of  individual  opinion 
on  the  construction  of  an  attitude  scale.  Journal  of  Abnormal  and  Social 
Psychology , 1963,  b]_,  290-292. 


* Hinrichs,  J.  R. , 6c  Gatewood,  R.  D.  Differences  in  opinion-survey  response 
patterns 'as  a function  of  different  methods  of  survey  administration.  Journal 
of  Applied  Psychology,  1967,  (6),  497-502. 

Instrument  Format,  Investigator  Error  3g,  7 

Journal  of  Applied  Psychology,  p.  497  (Rev.  from  rept.)  R-H 


Hochstim,  J.  R.  A critical  comparison  of  three  strategies  of  collecting 
data  from  households.  Journal  of  the  American  Statistical  Association,  1967, 
^(319),  976-989. 


Hofstaetter,  P.  R.  Importance  and  actuality.  International  Journal  of 
Opinion  and  Attitude  Research,  1951,  31-52. 

Attitude  Measures,  Multiple  Choice  Items,  Rating  3a,  10 

Scales,  Data  Analysis,  Question  Stem 

Psych.  Abst. , 26,  #4738  (Rev.  from  rept.)  R-N 


Hofstaetter,  P.  The  actuality  measure  in  the  study  of  public  opinion. 
Journal  of  Applied  Psychology,  1953,  281-287. 


-125- 


Hofstee,  W.  K.  Secular  trends  in  an  adjective  checklist.  Educa tiona 1 
and  Psychological  Measurement,  1966,  363-367.  • 


Instrument  Format,  Paired  Comparison  Items,  Response 
Alternatives 


3c 


Educational  and  Psychological  Measurement, 
26 , p.  367  (Rev.  from  rept.) 


R-H 


Hofstee,  W.  K.  Comparative  vs.  absolute  judgments  of  trait  desirability. 
Educational  and  Psychological  Measurement,  1970,  639-646. 


Hohle,  R.  H.  Some  empirical  applications  and  evaluations  of  two  models 
for  discr iminability  scales;  Pair  comparisons  and  successive  intervals. 
Dissertation  Abstracts,  1962,  2^(7),  2476-2477. 


Holdaway,  E.  A.  Different  response  categories  and  questionnaire  response 
patterns.  The  Journal  of  Experimental  Education,  1971,  4^,  57-60. 


Instrument  Format,  Response  Alternatives 

Journal  of  Experimental  Education.  ^(2),  p.  57 
(Rev.  from  rept.) 


3g,  3f 
R-H 


Holdregc,  F.  E.,  Jr.  A combination  of  forced  choice  and  check  list  rating 
scales  for  the  evaluation  of  instrument  flying  proficiency.  Disser ta tion 
Abstracts , March  1960,  20,  3817-3818. 


Hollingworth,  H.  L.  Experimental  studies  in  judgment.  Archives  of  Psych- 
ology , New  York  1913,  No.  29. 


-126- 


L 


r 


* Hollis,  J.  R.  Evaluation  of  a vehicle  rating  scale.  U.  S.  Army  Ordnance 
Human  Engineering  Laboratory  Technical  Memo  No.  8,  1954. 

Military  Personnel,  Investigator  Error  9 

Psych.  Abst. , 3_1 , #1903  A-H 


Holzinger,  K.  J.  The  reliability  of  a single  test  item.  Journal  of  Educa- 
tional Psychology,  1932,  211-417. 


* Horn,  J.  L.,  6c  Cattell,  R.  B.  Vehicles,  ipsatization , and  the  multiple- 
method  measurement  of  motivation.  Canadian  Journal  of  Psychology,  1965, 
(4),  265-279. 

Personality  Measures,  Response  Bias,  Reliability  2,  12 

Psych.  Abst . , 40 , #2903  (Rev.)  A-M 


Horowitz,  J.  L.,  et  al.  Repeated  measures  effects  in  racial  attitude 
measurement.  College  Park,  Md.:  Maryland  University,  Cultural  Study  Center, 

1972.  Report  No.  RR-8-72. 


Horst,  A.  P.  A method  for  determining  the  absolute  affective  value  of  a 
series  of  stimulus  situations.  Journal  of  Educational  Psychology,  1932,  23 
: 418-440. 

[ 

^ Horst,  P.  The  chance  element  in  the  multiple-choice  test  item.  Journal 

[ of  General  Psychology,  1932  (a),  6,  209-211. 


Horst,  P.  The  difficulty  of  multiple -choice  test  item  alternatives. 
Journal  cf  Experimental  Psychology,  1932  (b) , 1^,  469-472. 


Horst,  P.  Obtaining  a composite  measure  of  a number  of  different  measures 

of  the  same  attribute.  Pyschome trika , 1936,  1^,  53-60. 


-127- 


/ 


* Horst,  P.,  & Wright,  C.  E.  The  comparative  reliability  of  two  techniques 
of  personality  appraisal.  Journal  of  Clinical  Psychology,  1959,  15(4),  388- 
391. 

Paired  Comparison  Items,  Rating  Scales,  Reliability,  2,  5,  12 

Personality  Measures,  Forced  Choice  Items,  Response  Bias 

ORA 


Hovland,  C.  I.,  & Slorif,  M.  Judgmental  phenomena-  and  scales  of  attitude 
measurement:  Item  displacement  in  Thurstone  scales.  Journal  of  Abnormal 
and  Social  Psychology,  1952,  822-832. 


* Howe,  E.  S.  Further  comparison  of  two  short-item  derivatives  of  the 
Taylor  Manifest  Anxiety  Scale.  Psychological  Reports,  1960,  6,  21-22. 

Forced-Choice  Items,  Response  Bias  2,  12 

Psych . Abst . , 34,  #7848  A-H 

* Hubbard,  A.  W.  Phrasing  questions.  Journal  of  Marketing,  1950,  1^,  48-56. 

Instrument  Format,  Question  Stem,  Reliability  3g , 4,  17 

Potter,  Sharpe,  Hendee , and  Clark,  1972  (Rev.  from  rept.)  R-M 


Hubbard,  F.  W.  Questionnaires.  Review  of  Educational  Research,  1939,  9, 
502-507. 


Hubbard,  F.  W.  Questionnaires,  interviews,  personality  schedules.  Review 
of  Educational  Research,  1942,  12  (5),  534-541. 


Literature  Review  16 

Potter,  Sharpe,  Hendee  & Clark,  1972  A-H 


* Hughes,  G.  D.  A new  tool  for  sales  managers.  Journal  of  Marketing  Research, 
1964,  I,  32-38. 

Check  List,  Semantic  Differential  Items  17 

ORA 


-128- 


Journal  of 


* Hughes,  G.  D.  Selecting  scales  to  measure  attitude  change. 
Marketing  Research,  1967,  4,  85. 

Attitude  Measures,  Check  List,  Rating  Scales,  Respondent  s 2,li 
Motivation,  Serantic  Differential  Items,  Reliability, 

Validity 

r»  ti 


* Hughes,  G.  D.  Some  confounding  effects  of  forced-choice  scales.  Journal 
of  Marketing  Research,  1969,  6,  223-226. 

Forced  Choice  Items,  Instrument  Format,  Literature  1,  2,  3a 

Review,  Response  Alternatives,  Semantic  Differential  Items 

ORA 


Humm,  D.  G.,  uc  Humm,  K.  A.  Compensations  for  subjects'  response-bias  in 
a measure  of  temperament.  American  Psychologist,  1947,  305. 


Humm,  D.  G.,  & Humm,  K.  A.  Notes  on  "The  validity  of  personality  inven- 
tories in  military  practice,”  by  Ellis  and  Conrad.  Psychological  Bulletin, 
1949,  303-306. 


Hummel,  R.  C.  Interviewee  responsiveness  as  a function  of  interviewer 
method.  Dissertation  Abstracts,  1959,  1^,  1846. 


Humphrey,  R.  L.  Troop -community  relations  research  in  Korea;  Educational 
materials  (AD  865  725L) , Silver  Spring,  Maryland;  American  Institutes  for 
Research,  1969.  Report  No.  AIR-R69-15,  AIR-E35-S/69-EM. 


N/A 

18 

ORA 

R-HA 

Hunt,  W.  A.  Anchoring  effects  in  judgment, 

ogy,  1941,  395-403. 

American  Journal  of  ! 

Rating  Scales 

3f 

American  Journal  of  Psychology,  54,  p.  403 

R-H 

-129- 


* Hunt,  W,  A.,  & Volkmann,  J.  The  anchoring  of  an  affective  scale.  American 
Journal  of  Psychology,  1937,  49,  88-92. 

Rating  Scales  3f 

ORA  R-M 


Hurd,  A.  W.  Compaiisons  of  short  answer  and  multiple-choice  tests  covering 
identical  subject  content.  Journal  of  Educational  Research,  1932,  28-30. 


Husek,  T.  R.  Acquiescence  as  a factor  in  test-taking  behavior  and  as  a 
personality  characteristic.  Dissertation  Abstracts,  1959,  1^,  2650-2651. 


Husek,  T.  R.  Acquiescence  as  a response  set  and  as  a personality  character- 
istic. Educational  and  Psychological  Measurement,  1961,  21(2),  295-307. 


Hutchinson,  B.  Some  problems  of  measuring  the  intensiveness  of  opinion 
and  attitudes.  International  Journal  of  Opinion  and  Attitude  Research,  \ 
1949,  3(1),  123-131. 


Hyman,  H.  Do  they  tell  the  truth?  Public  Opinion  Quarterly,  1944,  8, 
557-559. 


Hyman,  H.  Inconsistencies  as  a problem  in  attitude  measurement.  Journal 
of  Social  Issues,  1949(a),  ^(3),  32-42. 

Attitude  Measures,  Investigator  Error  13 

Psych.  Abst.,  #1018  A-N 


Hyman,  H.  Isolation,  measurement,  and  control  of  interview  effect. 
Social  Sciences  Research  Council,  1949  (b) , 3,  15-17. 


Hyman,  H.  H.  Problems  in  the  collection  of  opinion  research  data. 
American  Journal  of  Sociology,  1950,  362-370. 


Hyman,  H.  H.  Survey  design  and  analysis;  Principles,  cases,  and  procedures. 
Glencoe,  111.;  Free  Press,  1955. 


-130- 


Hyman,  H,  H.,  Cobb,  W.  J.,  Feldman,  J.  J.,  Hart,  C.  W.,  & Stember , C.  H. 
(Eds.)  Interviewing  in  social  research.  Chicago:  University  of  Chicago  Press 
1954. 


Industrial  Relation  Center,  University  of  Minnesota.  Validity  of  work 
histories  obtained  by  interview.  The  Minnesota  Studies  in  Vocational 
Rehabilitation,  1961. 


The  interview  in  social  research.  American  Journal  of  Sociology,  1956,  62 
(2),  137-194.  • ~ 


Isard,  E.  S.  The  relationship  between  item  ambiguity  and  discriminating 
power  in  a forced-choice  scale.  Journal  of  Applied  Psychology,  1956, 
266-268. 

Clarity,  Response  Bias,  Forced  Choice  Items,  Rating  4,  12 

Scales 

ORA  R-M 


Issac,  S.,  and  Michael,  W.  B.  Handbook  in  research  and  evaluation.  San 
Diego,  Calif.:  Robert  R.  Knapp,  1971. 


Ito,  R.  An  analysis  of  response  errors  - A case  study.  Journal  of  Business 
1963,  36(4),  440-447. 


Ivens , 3.  H.  Nonparametr ic  item  evaluation  index.  Educational  and 
Psychological  Measurement,  1971,  > 831-842. 


Izard,  C.  E.,  & Rosenberg,  N.  Effectiveness  of  a forced-choice  leadership 
test  under  varied  experimental  conditions.  Educational  and  Psychological 
Measurement , 1958,  1^,  57-62. 

Military  Personnel,  Forced  Choice  Items,  Reliability,  2,  12 

Validity,  Response  Bias 


Psych.  Abst. , 33_,  #7293 


A-M 


Jackson,  C .A . , Woods,  A.  R.,  Stockston,  T . W . , & Schoof,  M.  W.  OSD 
Reserve  Component  Study,  Fc.  Hood,  Texas.  Test  6,  Phase  I.  Criteria  Valida- 
tion Report.  Washington,  D.  C.:  Department  of  the  Army,  1972.  4 vols. 

Military  Personnel,  Questionnaire  Theory  and  17 

Development 


Jackson,  D.  N.  Acquiescence  response  styles:  Problems  of  identification 
and  control.  In  Berg,  I.  A.  (Ed.),  Response  set  in  personality  assessment. 
Chicago;  Aldine,  1967.  Pp . 71-114. 


Jackson,  D.  N.,  Guthrie,  G.  M.  Mol ti tra i t-mul time thod  evaluation  of  the 
Personality  Research  Form.  Proceedings  of  the  American  PsycholoRica 1 Asso- 
ciation , 1968,  177-178. 


* Jackson,  D.  N.,  & Messick,  S.  J.  A note  on  "ethnocentrism"  and  acquiescent 
response  sets.  Journal  of  Abnormal  and  Social  Psychology,  1957,  54,  132-134. 


Response  Bias,  Attitude  Measures,  Instrument  Format 
Psych . Abs  t . , 33,  #1279 


12,  14 


Jackson,  D.  N.,  & Messick,  S.  Response  styles  and  the  assessment  of  psycho- 
pathology. In  Messick,  3.,  & Ross,  J.  (Eds.),  Measurement  in  personality  and 
cognition . New  York:  John  Wiley,  1962.  Pp . 129-155. 


Jackson,  D.  N.,  & Messick,  S.  A distinction  between  judgments  of  frequency 
and  of  desirability  as  determinants  of  response.  Educational  and  Psychologi- 
cal Measurement,  1969,  29,  273-293. 


* Jackson,  D.  N.,  & Minton,  H.  L.  A forced-choice  adjective  preference 

scale  for  personality  assessment.  Psychological  Reports,  1963,  J^(2) , 515-520. 


Forced-Choice  Item,  Reliability,  Response  Bias, 

Paired  Gompari.son  Items,  Check-List,  Personality  Measures 

Psych.  Abs t . , #4303 


2,  12,  14 


-132- 


Jackson,  D.  N.,  & Morf,  M.  E.  An  analysis  of  two  response  styles:  true 
responding  and  item  endorsement.  Educational  and  Psychological  Measurement. 
1972,  32(2) , 329-53. 


Jackson,  R.  M.,  & Rothney,  J.  W.  A comparative  study  of  the  mailed  ques- 
tionnaire and  the  interview  in  follow-up  studies.  Personnel  and  Guidance 
Journal , 1961,  39^,  569-571. 


Jackson,  D.  N.,  et  al.  An  evaluation  of  forced-choice  and  true-false  formats 
in  personality  assessment.  Princeton,  New  Jersey:  Educational  Testing  Service, 
1971.  Report  No.  RB-71-67. 

N/A  N/A 

N/A  A-H 


Jackson,  D.  N.,  et  al.  An  evaluation  of  forced-choice  and  true-false  item 
formats  in  personality  assessment.  Journal  of  Research  in  Personality,  1973, 
7(1)  , 21-30. 


Jacobs,  S.  S.  An  experimental  analysis  of  answer-changing  behavior  on 
objective  tests  (ERIC  Document  Reproduction  Service,  ED  048  345).  Paper 
presented  at  the  Annual  Meeting  of  the  American  Educational  Research  Asso- 
ciation, New  York,  New  York,  1971. 

Achievement  Measures  18 

ERIC  Document  Reproduction  Service,  ED  048  345  A-N 


Jacobs,  S.  S.  Answer  changing  on  objective  tests:  Some  implications  for 
test  validity.  Educational  and  Psychological  Measurement,  1972,  32(4)  , 
1039-1044. 

Achievement  Measures  18 

ERIC  Document  Reproduction  Service,  EJ  069  060  A-N 


5k 


-133- 


Jacobs,  T.  0.,  et  al.  A Ruide  for  developinR  questionnaire  items. 
Alexandria,  Virginia:  Human  Resources  Research  Organization,  1970.  Report 
No.  HUMRRO-RBP-D4-70-1 . 

Textbook 

ORA  R'H 


Jacobson,  E.  A.  A comparison  of  the  effects  of  instructions  and  models 
upon  interview  behavior  of  high-dependent  and  low-dependent  subjects. 
Dissertation  Abstracts,  1969,  ^(9-B) , 3485. 


* Jacoby,  J.,  & Matell,  M.  Three-point  Likert  scales  are  good  enough. 
Journal  of  Marketing  Research,  1971,  8,  495-500. 

Instrument  Format,  Rating  Scales,  Literature  Review,  3a 

Response  Alternatives,  Reliability,  Validity 

ORA  R'H 


* Jahn,  J.  A.  Some  further  contributions  to  Outtman's  theory  of  scale 
analysis.  American  Sociological  Review,  1951,  J^,  233-239. 

Data  Analysis,  Scaling  15 

Psych . Abst . , 27 , #1069  A-N 


Jahoda,  M.,  Deutsch,  M. , & Cook,  S.  W.  Data  collection,  the  question- 
naire and  interview  approach.  In  Research  methods  in  social  relations  with 
especial  reference  to  predjudice.  New  York:  Dryden  Press,  1951.  Pp  • 151-208. 


Jain,  S.  K.  Concept  of  soldiers'  efficiency.  Psychological  Researches. 
19b6,  1(1-2),  15-23. 


Jakobovits,  L.  A.  The  affect  of  symbols:  Towards  the  development  of  a 
cross-cultural  graphic  differential.  International  Journal  of  Symbology, 
1969,  1(1),  28-52. 


-134- 


Jarrett,  R.  F.,  & Sherriffs,  A.  C.  Forced-choice  versus  permissive  tech- 
niques in  obtaining  responses  to  attitude  questionnaires.  Journal  of 
General  PsycholoRv,  1956,  203-206. 


Forced  Choice  Items,  Investigator  Error  7 

Psych.  Abst. , 33,  #8127  A-H 


Jenkins , 
billty . 


J.  G.  Characteristics  of  the  question  as 
Journal  of  Consulting  Psychology,  1941, 


determinants 

164-169. 


of  dependa- 


Clarity,  Investigator  Error,  Instrument  Format 


3g.  13,  4 


ORA 


R-M 


Jenkins,  J.  J.,  Russell,  W.  A.,  & Suci,  J.  An  atlas  of  semantic  profiles 
for  360  words.  American  Journal  of  Psychology,  1958,  7_1,  688-699. 

Semantic  Differential  Items,  Reliability  14 

Report  Summary  R-M 


Jensen,  J.  A.,  & Schmitt,  J.  A.  The  influenc,-'  of  test  title  on  test 
response.  Journal  of  Educational  Measurement,  1970,  _7>  241-246. 

Investigator  Error,  Personality  Measures,  13,  7,  11 

Respondent's  Motivation 

Journal  of  Educational  Measurement,  7,  p.  241  R-NA 


Jensen,  M.  B.  An  evaluation  of  three  methods  of  presenting  true-false 
examinations.  School  and  Society,  1930,  3^j  675-677. 

Instrument  Format,  True-False  Items,  Reliability  3g , 7 

ORA  R-H 


Johnson,  D.  M.  Confidence  and  speed  in  the  two-category  judgment. 
Archives  of  Psychology,  New  York,  1939,  No.  241. 


Psychophysical  Measures,  Instrument  Length 


5 


* Johnson,  D.  M.  Reanalysis  of  experimental  halo  effects.  Journal  of 
Applied  Psychology,  1963,  46-47. 

Data  Analysis,  Investigator  Error,  Rating  Scales  7,  12 

Journal  of  Applied  Psychology,  47 , 46  (Rev.)  R-H 


* Johnson,  D.  M. , & Vidulich,  R.  N.  Experimental  manipulation  of  the  halo 
effect.  Journal  of  Applied  Psychology,  1956,  130-134. 

Investigator  Error  12 

Psych.  Abst.  , #5792  A-M 


* Johnson,  K.  K.  Company  interviewers  rate  job  applicants.  Personnel  and 
Guidance  Journal,  1958,  36,  422-424. 

Interviews,  Investigator  Error  9,  1.3 

Psych.  Abst.  , 33,  #7000  A-N 


Johnson,  R.  H.,  & Bond,  G.  L.  Reading  ease  of  commonly  used  tests. 
Journal  of  Applied  Psychology,  1950,  34,  319-324. 

Clarity  4,9 


Psych . Abs t . , 26 , #299  (Rev.) 


A-M 


Jones,  E.  E.,  & Sigall,  H.  The  bogus  pipeline:  A new  paradigm  for  measur- 
ing affect  and  attitude.  Psychological  Bulletin,  1971,  _7&,  349-364. 


Jones,  L.  V.,  & Bock,  R.  D.  The  Measurement  and  Prediction  of  Judgment  and 
Choice . San  Francisco:  Holden-Day,  1968. 


Jones,  L.  V.,  & Jeffrey,  T.  E.  Development  of  suitable  rating  scales  for 
measuring  the  subjective  reactions  of  troops  using  QM  items  under  actual 
field  test  conditions.  Chapel  Hill,  N.  C.:  University  of  North  Carolina, 
Psychometric  Laboratory,  1959. 


-136- 


* Jones,  L.  V.,  &Thurstone,  L.  L.  The  psychophysics  of  semantics:  An 

experimental  investigation.  Journal  of  Applied  Psychology  1955,  39,  31-36. 

Adjectives  ^ 

ORA  R'H 


Jones,  L.  V.,  Thurston,  L.  L.,  & Maston,  R.  Measurement  of  food  preference 
and  prediction  of  consumer  choice.  QM  Food  and  Container  Institute,  1953. 
Project  No,  7-84-15-007. 


* 


Jones,  M.  B.  The  deliberate  use  of  a set  to  'fake'  in  personality  question- 
naires. Pensacola,  Fla,:  Naval  School  of  Aviation  Medicine,  1959.  Report  No. 


29. 


Response  Bia'  Personality  Measures  12 

Jones  , 1958  A-M 


* Jones,  R.  R.  Differences  in  response  consistency  and  subject's  preferences 
for  three  personality  inventory  response  formats.  Proceedings  of  the  76th 
Annual  Convention  of  the  American  Psychological  Association.  1968,  3,  247- 
248.  (Summary) 

Multiple-Choice  Items,  Response  Alternatives,  2,  3a,  11 

Respondent's  Mot'vation,  Reliability,  True-False  Items 

ORA  R-H 


Jones,  S.  Process  testing  - an  attempt  to  analyze  reasons  for  students' 
responses  to  test  questions.  Journal  of  Educational  Research,  1953,  46 , 
525-534. 

Achievement  Measures  18 

Psych.  Abst . , 2^,  #3292  A-N 

* Jury,  P.  A.  The  relation  of  sample  demographic  characteristics  to  job 
satisfaction.  University  of  Minnesota,  Technical  Report  No.  9001.  1971. 

Investigator  Error  9 

DDC,  # AD  724  809  A-M 


-137- 


Kahn,  R.  L.,  & Cannell,  C.  F.  The  dynamics  of  InterviewlnR . New  York; 
John  Wiley , 1957 . 

Interviews  13 

Bauman,  Rogers,  and  Weiss,  1971  A-M 


Kahn,  R.  L.,  & Cannell,  C.  F.  Interviewing:  I.  Social  research,  in  Interna- 
tional Encyclopedia  of  the  Social  Sciences.  D.  L.  Sills  (Ed.),  New  York: 
Macmillan  and  Free  Press,  1968,  8,  149-161. 


Kamenetzky,  J.,  Burgess,  G.  G.,  & Rowan,  T.  The  relative  effectiveness 
of  four  attitude  assessment  techniques  in  predicting  a criterion.  Educa - 
tional  and  Psychological  Measurement,  1956,  J^,  187-197. 

Rating  Scales,  Projective  Items  2 

Psych.  Abst.  , #5101  A-M 


Kamfer , L.  The  utility  of  a buddy  rating  procedure  as  opposed  to  a socio- 
metric test  for  the  identification  of  military  leaders.  Psychol . Afr . . 1962, 
9,  37-43. 

Rating  Scales  1 

Psych.  Abst.  , 2Z>  ^1^5718  A-M 


* Kane,  R.  B.  Reducing  proximity  error  in  administering  the  semantic  differ- 
ential (ERIC  Document  Reproduction  Service,  ED  032  761).  Lafayette,  Indiana; 
Purdue  University,  1968.  Contract  OEC-0-070189-2508 . 

Instrument  Format,  Semantic  Differential  Items  3c,  2 

ERIC  Document  Reproduction  Service,  ED  032  761  A-H 


Kane,  R.  B.  Computer  generation  of  semantic  differential  (SD)  question- 
naires. Educational  and  Psychological  Measurement,  1969,  ^(1),  191-192. 


.1  TQ_ 


Kane,  R.  B.  Minimizing  order  effects  in  the  semantic  differential. 
Educational  and  Psychological  Measurement,  1971,  > l'^7-144. 

Semantic  Differential  Items,  Response  Alternatives,  3b,  3c 

Instrument  Format 

ORA  R-H 


Kapoor,  S.  D.  A comparative  study  of  the  personality  questionnaire  items 
presented  in  the  first  and  second  person.  Manas , 1963,  10,  35-44. 

Question  Stem,  Investigator  Error,  Personality  3g 

Measures 

Psych.  Abst.  , #8496  A-H 


* Karr,  C.  A comparison  of  EPPS  scores  obtained  from  the  standard  forced- 
choice  procedure  and  a rating-scale  procedure.  Dissertation  Abstracts, 
1959(a),  19,  3382-3383. 


* Karr,  C.  Two  methods  for  scoring  self-rating  scales  to  approximate  forced- 
choice  results.  Psychological  Reports,  1959  (b) , _5,  773-779. 

Scoring  8 

Psychological  Reports,  5,  p.  778  R-M 


Kashiwagi,  S.  Psychological  rating  of  human  fatigue.  Ergonomics , 1971, 
14(1),  17-21. 


* Kassarjian,  H.  H. , & Nakanishi,  M.  Study  of  selected  opinion  measurement 
techniques.  Journal  of  Marketing  Research,  1967,  4,  148-153. 

Check  List,  Ranking,  Paired  Comparison  Items,  2,  3a 

Response  Alternatives,  Rating  Scales 

ORA  R-H 


-139- 


I 


Katona,  G.,  Mandell,  L.,  & Schmiedeskamp , J.  W.  Questionnaire  - Chapter 
15.  In  Survey  Research  Center  (Ed.),  1970  Survey  of  consumer  finance. 

Ann  Arbor,  Mich.:  Michigan  University,  Survey  Research  Center,  Institute 
for  Social  Research,  1971. 


Katz,  D.  Do  interviewers  bias  poll  results?  Public  Opinion  Quarterly, 
1944  (a),  8,  468-482. 


Katz,  D.  The  measurement  of  inteiisity.  In  H.  Cantril  et  al..  Gauging 
public  opinion.  Princeton,  N-  J.:  Princeton  University  Press,  1944  (b) . 
Pp.  51-65. 


Katz,  D.  The  interpretation  of  survey  findings.  Journal  of  Social  Issues, 
1946,  2,  33-44.  ~ 


Katz,  D.,  Cartwright,  D.,  Eldersveld,  S.  J.,  & Lee,  A.  M.  (Eds.)  Public 
opinion  and  propaganda  - A book  of  readings.  New  York:  Holt,  Rinehart  and 
Winston,  1954. 


Katz,  D.  B.,  et  al.  Guide  to  special  issues  and  Indexes  to  periodicals. 
Special  Libraries  Association,  1962. 


Kauffman,  R.  E.  The  open-ended  and  closed  question:  Some  basic  considera- 
tions. New  Scholar,  1970,  101-118. 


K.awash,  M.  B.,  fitAleamoni,  L.  M.  Effect  of  personal,  signature  on  the 
initial  rate  of  return  of  a mailed  questionnaire.  Journal  of  Applied  Psych- 
ology,, 1971,  ^ (6),  589-592.  — 


Kay,  H.  A new  approach  to  projective  testing  in  survey  research.  Public 
Opinion  Quarterly,  1955,  1^(3),  267-278. 

Keating,  E.,  Paterson,  D.  G.,  & Stone,  C.  H.  Validity  of  work  histories 
obtained  by  interview.  Journal  of  Applied  Psychology,  1950,  34,  6-11. 


-140- 


Keats,  J.  A.  A method  of  treating  individual  differences  in  multi-dimen- 
sional scaling.  British  Journal  of  Statistical  Pscyhology,  1964,  37-50. 


Data  Analysis 

British  Journal  of  Statistical  Psychology,  17 , p.  37 


Keats,  J.  A.  Test  theory.  Annual  Review  of  Psychology,  1967,  1^,  217-238. 


Keil,  W.  Investigations  on  successive  item  reversions  in  self-descriptive 
statements.  Archiv  fur  Psychologie,  1971,  123(3-4) , 285-323. 


Kelley,  H.  H.,  Hovland,  C.  I.,  Schwartz,  M.  , &Abelson,  R.  P.  The  influ- 
ence of  judge's  attitudes  in  three  methods  of  scaling.  Journal  of  Social 
Psychology , 1955,  147-158. 

Attitude  Measures,  Paired  Comparison  Items,  Rating  10,  8,  2 

Scales,  Scaling 


Kelley,  R.  F.,  & Stephenson,  R.  The  semantic  differential:  An  information 
source  for  designing  patronage  appeals.  Journal  of  Marketing,  1967,  31 , 
43-47. 

Semantic  Differential  Items  17 


Kelley,  T.  L.  The  scoring  of  alternative  responses  with  reference  to  some 
criterion.  Journal  of  Educational  Psychology,  1934,  504-510. 


Kelle.y,  T.  L.  Development  of  an  activity  preference  test, 
nel 'Training  Research  Center,  1957.  Report  No.  57-107  . 

k 

Military  Personnel,  Preference  Measures,  Reliability 
Psych.  Abst.  , 3^,  #9312 


USAF  Person- 

18 

A-N 


Kellogg,  W.  N.  The  time  of  judgment  in  psychometric  measures.  American 
Journal  of  Psychology,  1931,  fQ,  65-86. 


Kelly,  E.  L.,  Miles,  C.  C.,  & Terman,  L.  M.  Ability  to  influence  one's 
score  on  a typical  pencil  and  paper  test  of  personality.  Character  and 
Personality , 1936,  4,  206-215. 


Kendall,  L.  M.  An  investigation  of  hypotheses  regarding  the  way  adjustment 
of  time  limits  affects  validity.  Paper  read  at  the  33rd  meeting  of  the 
Eastern  Psychological  Association,  Atlantic  City,  1962. 


N/A 

N/A 

N/A 

T-H 

Kendall,  L.  M.  The  effects  of  varying  time  limits  on  test  validity. 
Educational  and  Psychological  Measurement,  1964,  1^,  789-800. 

Multi-Choice  Theory,  Military  Personnel,  7 

Investigator  Error,  Achievement  Measures 

Psych.  Abst . , 39  , #7729  A-N 


Kendall,  M.  G.  Rank  correlation  methods.  London:  Griffin,  1948. 


Kendall,  M.  G.,  & Smith,  B.  B.  The  problem  of  n rankings.  Annals  of 
Mathematical  Statistics,  1939,  1£,  275-287. 


Kendall,  P.  Conflict  and  mood;  factors  affecting  stability  of  response. 
Glencoe,  111.:  Free  Press,  1954. 

Response  Bias,  Reliability,  Investigator  Error,  12,  3g,  11  10  4 

Response  Alterna tives, Respondent ' s Motivation,  Clarity 


Psych.  Abst.  , 29_,  #2101 


A-H 


Kentucky  University,  Home  Economics  Education.  Listing  of  evaluative  and 
other  types  of  Instruments.  Lexington,  Kentucky:  Kentucky  University, 
Home  Economics  Education,  1962. 


Keohane,  J.  Methods  for  surveying  employee  attitudes.  Occupational  psy 
choloRv.  1971,  217-231. 


Vephart,  W.,  & Bressler,  M.  Increasing  the  responses  to  mail  question- 
naires: A research  study.  Public  Opinion  Quarterly,  1958,  22(2),  123-132. 


Kinard,  A.  M.  Randomizing  error 
Marketing , 1955,  1^,  260-262. 

Multiple  Choice  Items 

Bureau  of  Census 


in  multiple-choice  questions.  Jcurnal  of 

7,  17,  3c 
R-N 


ing,  F.  W.  Anonymous  versus  identifiable  questionnaire  in  drug  usage 
urveys.  American  Psychologist , 1970,  ^ 982-985. 


King,  J.  E.  Multiple -item  approach  co  luerit  rating.  American  Psycholo- 
gis t , 1949,  4,  278. 


King,  M.  B.,  Jr.  Reliability  of  the  idea  centered  question  in  interview 
schedules.  American  Sociological  Review,  1944,  9,  57-64. 


Kingsbury,  F.  A.  Analyzing  ratings  and  training  raters.  Journal  of 
Personnel  Research,  1923,  1^,  377-383. 


Kinsey,  A.  C.,  Pomeroy,  W.  B.,  & Martin,  C.  E,  Sexual  behavior  in  the 
human  male.  Philadelphia  and  London:  W.  B.  Saunders,  1948. 


-143- 


Kirby,  D. , & Gardner,  R.  Norms  on  208  words  typically  used  in  the  assess- 
ment of  ethnic  stereotypes.  University  of  Western  Ontario  Research  Bulletin 
177,  1971. 


Kirkpatrick,  C.  Assumptions  and  methods  in  attitude  measurement. 
American  Sociological  Review,  1936,  1^,  75-88. 


Kish,  L.  Survey  sanplinR.  New  York:  John  Wiley,  1965. 


Klein,  E B.  Stylistic  components  of  response  as  related  to  attitude 
change.  Journal  of  Personality,  1963,  3^,  38-51. 


* Klein,  S.  M. , Maher,  J.  R , & Dunnington,  R.  A.  Differences  between  iden- 

tified and  anonymous  subjects  in  responding  to  an  industrial  opinion  survey. 
Journal  of  Applied  PsycholoRv,  1967,  152-160. 

Response  Bias,  Respondent's  Motivation,  Anonymous  11,  12,  3g 

Respondent,  Investigator  Error,  Question  Stem 

Journal  of  Applied  Psychology,  5 1 , p.  152  R-H 


Knoell,  D.  M.,  & DeGaugh,  R.  A.  A scaling  technique  designed  to  give 
approximations  to  factor  scales.  USAF  Personnel  Training  Research  Center, 
1957 . Research  Report  No.  57-21 . 


Knower , F.  H.  An  inventory  of  public  opinion  pollers'  interviewing  problems. 
International  Journal  of  Opinion  and  Attitude  Research,  1951,  221-228. 

Interviews,  Investigator  Error,  Validity  Reliability 

Psych.  Abst. , #5492 

Knowles,  J.  B.  Acquiescence  response  set  and  the  questionnaire  measure- 
ment of  personality.  British  Journal  of  Social  and  Clinical  Psychology, 
1963,  .2,  131-  137. 

Response  Bias,  Rating  Scales,  True-False  Items  12,  2 

Psych  . Abs  t . , 38,  #4306  A-H 


Knudsen,  D.  D.,  Pope,  H.  , & Irisli,  D.  P.  Response  differences  to  questions 
on  sexual  standards;  An  interview  - questionnaire  comparison.  Public 
Opinion  Quarterly,  1967,  3i_,  290-297  . 

Interviews,  Anonymous  Respondent  1,  11 

Potter,  Sharpe,  Hendee  and  Clark,  1972  R-H 


Koehler,  R.  A.  A comparison  of  the  validities  of  conventional  choice 
testing  and  various  confidence  marking  procedures.  Journal  of  Education- 
al Measurement,  1971,  8(4),  297-303. 


Koehler,  R.  A.  Coombs ' type  response  procedures  (ERIC  Document  Reproduc- 
tion Service,  ED  063  338).  Paper  presented  at  the  Annual  Meeting  of  the 
American  Educational  Research  Association , Chicago,  1972. 


Kogan,  W.  S.,  & Fordyce , W.  E.  The  control  of  social  desirability:  A com- 
parison of  three  different  Q sorts  and  a check  list,  all  composed  of  the 
same  items.  Journal  of  Consulting  Psychology,  1962,  ^(1),  26-30. 


Kohan,  S.,  deMille,  R.,  & Myers,  J.  Two  comparisons  of  attitude  measures. 
Journal  of  Advertising  Research,  1972,  j_2,  29-34. 

Close-Ended  Items,  Open-Ended  Items,  Rating  Scales  1,  2 

ORA  R-M 


Komorita,  S.  S.  Attitude  content,  intensity,  and  the  neutral  point  on  a 
Likert  scale.  Journal  of  Social  Psychology,  1963,  61^,  327-334. 

Data  Analysis,  Scaling,  Attitude  Measures,  Scoring  2,  8 

Journal  of  Social  Psychology,  61 , pp , 327-334  R-M 

(Rev.  from  rept.) 


Komorita,  S.  S.,  & Graham,  W.  K.  Number  of  scale  points  and  the  reliabil- 
ity of  scales.  Educational  and  Psychological  Measurement,  1965,  25(4), 
987-995.  ~ 

3a  , 8 


Response  Alternatives,  Reliability 
Psych.  Abst.  , 40,  #3576 


A-H 


* Kornhauser,  A.  The  problem  of  bias  in  opinion  research.  International 
Journal  of  Opinion  and  Attitude  Research,  1947,  1_(4)  , 1-16. 


Investigator  Error,  Response  Bias  12 

ORA  R-M 


Kornhauser,  A.  Constructing  questionnaires  and  interview  schedules.  In 
Jahoda,  M. , Deitsch,  M. , & Cook,  S.  W.  (Eds.),  Research  methods  in  social 
relations . New  York;  Dryden  Press,  1951.  Pp . 423-462. 


Kornhauser,  A.,  & Sheatsley , P.  B.  Qnestion,.aire  construction  and  inter- 
view procedure.  In  Sellitz,  C.,  Jahoda,  M.  , Deutsch,  M.,  & Cook,  S.  W. 
(Eds.),  Research  methods  in  social  relations.  New  York:  Holt,  Rineharc, 
and  Winston,  1959.  Appendix  C,  pp . 546-574. 


Kotula,  L.  J.,  & Haggerty,  H.  R.  Research  on  the  selection  of  officer 

candidates  and  cadets.  U.  S.  Army  Personnel  Researen  Organization,  1966. 
Research  Report  No.  1146. 


Krause,  M.  S.  The  validity  of  ratings.  Psychological  Reports,  1960,  _7, 
71-79. 


Krause,  M.  S.  Role-deviant  respondent  sets  and  resulting  bias,  their  de- 
tection and  control  in  the  survey  interviews.  Journal  of  Social  Psychology, 
1965,  67(1),  163-183. 


Krause,  M.  S.  Ordinal  scale  construction  for  convergent  validity,  object 
discrimination,  and  resolving  power.  Multivariate  Behavioral  Research. 
1966,  1(3),  379-385. 


Krech,  D.  , and  Crutchfield,  S.  Theory  and  problems  of  social  psychology. 
New  York:  McGraw-Hill,  1948. 


n- 


« . 


-146- 


Krech,  i;.  , Crutchfield,  S.,  and  Ballachey,  E.  Individual  in  society. 
New  York:  McGraw-Hill,  19b2. 


* Kriedt,  P.  H.,  & Clark,  K.  E.  "Item  analysis"  versus  "scale  analysis." 
Journal  of  Applied  Psychology,  1949,  22,  114-121. 

Scaling,  Data  Analysis,  Attitude  Measures  8,  14 

r 

Psych.  Abst. , 26 , #23  ‘ A-N 


Krieger,  M.  H.  A control  for  social  desirability  in  a semantic  differen- 
tial. British  Journal  of  Social  and  Clinical  Psychology,  1964,  2(2),  94-103. 

Response  Bias,  Semantic  Differential  Items,  12 

Scaling 

Psych.  Abst.  , 2£,  #7600  A-M 


Kroeger,  H.  J.  The  usefulness  of  the  multiple-choice  question.  Inter- 
national Journal  of  Opinion  and  Attitude  Research,  1947,  _1(1) > 102-105. 


Krueger,  W.  C.  F.  An  experimental  study  of  certain  phases  of  a true-false 
test.  Journal  of  Educational  Psychology,  1932,  22,  81-91. 


Krug,  R.  E. 
ology,  1958, 

A selection  set  preference  index. 
42,  168-170. 

Journal  of  Applied  Psych 

Adjec  tives , 

Response  Bias 

6,  12 

ORA 

R-H 

Krug,  R.  E.,  & Moyer,  K.  E.  An  analysis  of  the  F scale:  II,  Relationship 
to  standardized  personality  inventories.  Journal  of  Social  PsycholoHV , 
1961,  22,  293-301. 


-147- 


* Krug,  R.  E.,  & Northrup,  D.  Judgment  time  for  forced-choice  adjective 
pairs.  Journal  of  Applied  Psychology.  1959,  ^(6),  407-410. 

Forced  Choice  Items,  Adjectives,  Personality  Measures  6 

ORA  R-H 


Kuang,  H.  P.  A critical  evaluation  of  the  relative  efficiency  of  three 
techniques  in  item  analysis.  Educational  and  Psychological  Measurement, 
1952,  12,  248-266. 


* Kundu,  G.  A new  technique  for  the  measurement  of  attitude.  Psychological 
Studies , Mysore , 1960,  ^(2),  106-117. 


* Kundu,  G.  The  rationale  for  a new  technique  <-  attitude  measurement. 
Indian  Journal  of  Psychology,  1962,  37(3)  , 147-151. 

Scaling,  Response  Bias  12,  14 

Psych.  Abst.  , #10057  A-M 

Kundu,  G.  A comparison  of  the  Likert  and  a new  technique  of  attitude 
measurement.  Indian  Journal  of  Psychology.  1972  (a),  ^(3),  245-258. 

NA  NA 

NA  T-H 


Kundu,  G.  A new  technique  of  attitude  scale  construction.  Behaviorametric 
1972  (b),  2(1),  13-20.  ~ ’ 

Kurt,  B.  Combining  responses  on  two  forms  of  a questionnaire  with  options 
in  inverse  order.  Public  Opinion  Quarterly.  1949-50,  0(4),  688-689. 

Multiple  Choice  Items  Jata  Analy.sis,  Investigator  8,2 

ORA  R_N 


-148- 


Kuusinen,  J.  Affective  and  denotative  structures  of  personality  ratings. 
Journal  of  Personality  and  Social  Psychology.  1969,  1^(3),  181-188. 


La  Fave , L.,  & Sherif,  M.  Reference  scales  and  placement  of  items  with 
the  own  categories  technique.  Paper  presented  to  American  Psychological 
Association,  Annual  Meetings,  St.  Louis,  1962.  Norman,  Oklahoma:  Institute 
of  Group  Relations,  mimeographed. 


N/A 

N/A 

K/A 

T-H 

* landon,  E.  L.,  Jr.,  Order  bias,  the  ideal  rating,  and  the  semantic  differen 
Lial.  Journal  of  Marketing  Research,  1971,  8,  375-378. 

Instrument  Format,  Semantic  Differential  Items  3c 

ORA  R-H 


Lanfeld,  E.  S.,  & Saunders,  D.  R.  Some  determiners  of  performance  on  an 
experimental  sentence  arrangement-tes t . Journal  of  Clinical  Psychology, 
1961,  17,  238-41. 


Langer,  P.  Social  desirability  and  acquiescence  on  the  SORT.  Psychologi- 
cal Reports,  1962,  JA(2)  , 531-534. 


Lanman,  R.  W.,  & Remmers,  H.  H.  The  "preference"  and  "discrimination”  in- 
dices in  forced-choice  scales.  Educational  and  Psychological  Measurement. 
155';,  14,  541-551. 


Lansing,  J.  B.,  & Eapen,  A.  T.  Dealing  with  missing  information  in  survey 
Journal  of  Marketing,  1959,  22-28. 

Data  Analysis  18 

ORA  A-NA 


-149- 


Lansing,  J.  B. , Ginsberg,  G.  P.,  & Braaten,  K.  An  Investigation  of  re- 
sponse error.  Urbana , 111.:  University  of  Illinois  Press,  1961. 


Lantz,  D.  L.  The  effect  of  reinforcement  of  statements  of  pupil -teacher 
relationships  in  free  response  situation  on  Minnesota  teacher  attitude  in- 
ventory scores.  Research  Bulletin  67-28.  Princeton,  New  Jersey:  Education- 
al Testing  Service,  1967. 


N/A 

N/A 

N/A 

T-M 

LaPiere,  R.  T.  Attitudes  versus  actions.  Social  Forces,  1934,  j^,  230-237. 
Question  Stem  3g 

Potter,  Sharpe,  Hendee  and  Clark,  1972  \-N 


Lapointe,  R.  E.,  & Auclair,  G.  A.  The  use  of  social  desirability  in  forcyd- 
choice  methodology.  American  Psychologist,  1961,  1^,  446.  (Abstract)  / 


* Laurent,  A.  Effects  of  question  length  on  reporting  behavior  in  the  survey 
interview.  Journal  of  the  American  Statistical  Association,  1972,  bl_  (338), 
298-305. 


Question  Stem,  Interviews 
ORA 


3g 

R-H 


Laurent,  A.  VI.  Question  length  and  reporting  behavior  in  the  interview 
preliminary  investigation.  Ann  Arbor,  Mich.:  Survey  Research  Center, 
Institute  for  Social  Research,  n.d. 


Question  Stem,  Interviews 
ORA 


3g,  5 

R-H 


Lazarsfeld,  P.  F.  The  art  of  asking  'Why?'.  Three  principles  underlying 
the  formulation  of  questionnaires.  National  Marketing  Review,  1935,  1.(1), 
26-38. 


J 


Lazarsfcld,  P.  F. 
negotiation.  Pub 


The  controversy  over  detailed  interviews:  An  offer  for 
lie  Opinion  Quarterly,  1944,  8,  38-60. 


Lazarsfeld,  P.  F.,  & Barton,  A.  Some  general  princijjles  of  questionnaire 
classification.  In  Lazarsfeld,  P.  G.,  & Rosebart,  M.  (Eds.),  The  Language 
of  Social  Research.  Glencoe,  111.:  The  Free  Press,  1963.  Pp . 83-92. 


Lazarsfeld,  P.  F.,  Fiske,  M.  The  "panel"  as  a new  tool  for  measuring 
opinion.  Public  Opinion  Quarterly,  1938,  2,  596-613. 


Lazarsfeld,  P.  F.,  & Henry,  N.  W.  Latent -structure  analysis.  Boston: 
Houghton  Mifflin,  1968. 


Leeznar,  W.  B.  Comparison  of  test  items  across  forms.  USAF  PRL  TDR  No . 


Instrument  Length,  Achievement  Measures,  8 

Reliability 

Psych.  Abst.  , #7774  A-N 


! 


* Le Herman,  E.  Effect  of  the  response  format  on  the  differential  measure- 
ment of  traits  in  the  Thorndike  Dimensions  of  Personality  Inventory  (ERIC 
Document  Reproduction  Service,  Ed  052  249).  1971. 

Forced  Choice  Items,  Response  Bias  2,  12 

ERIC  Reproduction  Service,  ED  052  249  A-M 


Ledvinka  , J.  Race  of  interviewer  and  the  language  elaboration  of  black 
interviewees.  Journal  of  Social  Issues,  1971,  ^(4),  185-197. 


N/A  N/A 


I 


Lee,  J.  M.,  & Symonds , P.  M.  New  type  of  objective  tests:  A summary  of 
investigations.  Journal  of  Educational  PsycholoRv.  1934,  25,  161-184. 


* Lcftwich,  W.  H.,  & Renmiers,  H.  H.  A comparison  of  graphic  and  forced- 
choice  ratings  of  teaching  performance  at  the  college  and  university  level. 
Studies  in  Higher  Education.  1962,  No.  92. 


Response  Bias,  Forced-Choice  Items,  Rating  Scales 
Psych.  Abst. . 40,  #7071 


2,  12 

A-H 


Lehman,  R.  S.  Length  of  reference  line  in  similarity  ratings.  Perceotual 
and  Motor  Skills.  1967  , ^(l),  216.  — 


Rating  Scales,  Instrument  Format 
ORA 


3g 

R-H 


Lehmann,  D.  R. , & Hulbert,  J.  Are  three-point  scales  always  good  enough’ 
Journal  of  Marketing  Research.  1972.  9.  LhL-kLf. 


:ing  Research.  1972,  9,  444-446. 
Instrument  Format,  Response  Alternatives 
ORA 


3a 

R-M 


Lehnene,  R.  G.  Assessing  reliability  in  sample  surveys.  Public  Ooinion 
Quarterly , 1971-1972,  35(4),  578-592.  ^ 


Lontz,  T.  F.  Reliability  of  opinionnaire  teclinicjuc  studied  intensively 
by  the  retest  method.  Journal  of  Social  Psycholotty,  1934,  5,  338-364. 


Lentz,  T.  F.  Acquiescence  as  a factor  in  the  measurement  of  personality, 
Psychological  Bulletin,  1938,  2^,  659. 


Leslie,  L.  L.  Increasing  response  rates  to  long  questionnaires.  Journa 1 
of  Educational  Research,  1970,  63(8),  347-350. 


Instrument  Length,  Respondent's  Motivation 
ORA 


5,  11 
R-M 


Levine,  D.  B.,  & Miller,  H.  P.  Response  variation  encountered  with  dif- 
ferent questionnaire  forms  - an  experimental  study  of  selected  techniques 
used  in  agriculture  marketing  research.  Washington,  D.  C.:  United  States 

Government  Printing  Office,  1957.  Marketing  Research  Report  No.  163. 


N/A 

N/A 

N/A 

T-H 

Levine,  R.  S.  A study  of  a new  type  of  answer  sheet.  Statistical  Report 
56-23 . Princeton,  New  Jersey:  Educational  Testing  Service,  1956. 


N/A 

N/A 


N/A 

T-H 


Levonian,  G.  Reliability  of  personality  measurement  by  interview  survey 
methods.  Psychological  Reports,  1963,  J_3(2) , 467-474. 


Personality  Measures,  Interviews,  Reliability, 
Rating  Scales 


Psych . Abs t . , 38 , #8460 


A-M 


Levy-Leboyer , C.  I,.a  signification  des  omissions  dans  quatre  tests  collec- 
tifs:  etude  expe r imen ta le . (The  significance  of  omissions  in  four  group 
tests.)  Travail  Humaine,  1955  (a),  18,  315-321. 


-1''3- 


1 


•V 


(Omissions  in 


Levy-Leboyer , C.  Les  omissions  dans  les  tests  collectifs. 
group  tests.)  Travail  Humaine,  1955  (b)  , 96-108. 

Response  Bias,  Respondent's  Motivation  11,  12 

Psych,  Abst.  , #91  A-M 


i,ewin,  K.  Principles  of  topological  psycholoRV.  New  York:  McGraw-Hill, 

1936. 


Lewis,  H.  B.  Studies  in  the  principles  of  judgments  and  attitudes:  IV. 

The  operation  of  "prestige  suggestion."  Journal  of  Social  Psychology,  1941, 
U,  229-256. 


Lewis,  L.  H.  Acquiescence  response  set:  Construct  or  artifact.  Journal  of 
Projective  Techniques,  1968,  578-584. 


Liberty,  P.  G.,  Jr.  Methodological  considerations  in  the  assessment  of 
acquiescence  in  the  MA  and  SD  scale.  Journal  of  Consulting  Psychology, 
1965,  ^(1)  , 37-42. 

Response  Bias,  Question  Stem,  Personality  Measures  12,  3g 

Psych.  Abst.  , 2^,  A-M 


Liberty,  P.  G.,  Lunneborg , C.  E., 
dissumula tion , and  response  styles 
1964,  28,  529-537. 


& Atkinson 
Journal 


, G.  C.  Perceptual  defense 
of  Consulting  Psychology. 


Lichenstein,  E. 
response  set. 
638. 


R.  , Quinn,  P.,  & Hover,  G.  L.  Dogmatism  and  acquiescent 
Journal  of  Abnormal  and  Social  Psychology.  1961,  63,  636- 


Lieberman,  L.  R.,  & Walters,  S.  M.  The  one-year- to- 1 ive  situation:  Essay 
vs.  inventory.  Journal  of  Clinical  Psychology,  1972,  ^(2),  205-209. 


Likert,  R.  A teclinique  for  the  measurement  of  altitudes.  Archives  of 
Psyclto  loRv , 1932,  ^(140),  1-55. 

Scaling  17 


Likert,  R.,  Roslow,  S.,  & Murphy,  G.  A simple  and  reliable  method  of 
scoring  the  Thurstone  attitude  scales.  Journal  of  Social  Psychology, 
1934,  5,  228-238. 


Lilly,  R.  S.  A developmental-study  of  the  semantic  differential. 
Research  Bulletin  65-28.  Princeton,  New  Jersey:  Educational  Testing 
Service,  1965. 


Lindgren,  H.  C.  The  incomplete  sentences  test  as  a means  of  course  eval- 
uation. Educational  and  Psychological  Measurement,  1952,  12,  217-225. 


Lindquist,  E.  F.  Factors  determining  reliability  of  test  forms.  Journa 1 
of  Educational  Psychology,  1930,  512-520. 


Lindzey,  G.  A note  on  interviewer  bias.  Journal  of  Applied  Psychology, 
1951,  35(3) , 182-184. 

Interviews,  Investigator  Error  12 

Psych.  Abst.  , 2^,  #832  R-M 


Lindzey,  G.  Experiments:  Their  planning  and  execution.  In  Lindzey,  G. 
(Ed.),  Handbook  of  social  psychology.  Cambridge,  Mass.:  Addison-Wesley 
Publishing  Company,  1954. 


-155- 


Lindzey  , G.,  & BorgotLa  , E.  F.  Socioiiietric  measurement.  In  Lindzey,  G. 
(Ed.),  Handbook  of  social  psychology,  Vo  1 . 1 . Reading,  Mass.;  Addison- 
Wesley,  1954.  Chapter  11. 


Lindzey,  G-  E.,  & Guest, 
Public  Opinion  Quarterly, 

L.  To  repeat-  check  lists  can 
1951,  1_5,  355-358. 

be  dangerous. 

Check  List , 

Response  Bias 

, Response  Alternatives 

2,  8,  13,  3a 

Psych.  Abst 

. , #5494 

A-H 

Link,  H.  C.  An  experiment  in  depth  interviewing  on  the  issue  of  inter- 
nationalism vs.  isolationism.  Public  Opinion  Quarterly,  1943,  1_,  267-279. 


Linton,  H.  B.  Dependence  on  external  influence:  Correlates  in  perception 
attitudes,  and  judgment.  Journal  of  Abnormal  and  Social  Psychology,  1955, 
502-507. 


Litwak,  E.  A classification  of  biased  questions. 
Sociology , 1956,  W(2)  , 182-186. 

Investigator  Error,  Clarity,  Data  Analysis 

American  Journal  of  Sociology,  62 , p.  182 


American  Journal  of 

13,  4,  8 
R-M 


Lloyd,  B.  B.,  6t  Innes , J.  M.  Influence  of  past  experience  on  meaningful- 
ness of  concepts  on  a semantic  differential.  Psychological  Reports,  1969, 
^(1),  269-270. 


Loevinger,  J.  The  technique  of  homogeneous  tests  compared  with  some  as- 
pects of  "scale  analysis"  and  factor  analysis.  Psychological  Bulletin, 
1948,  507-529. 


Loevinger,  J.  A theory  of  test  response.  Proceedings  of  the  1958  Invlta 
tional  Conference  on  Testing  Problems.  Princeton,  N.J.:  Educational  Test- 
ing Service,  1959.  Pp . 36-47. 


Long,  B.  H.,  Henderson,  E.  H.,  & Ziller,  R.  C.  Self-ratings  on  the 
semantic  differential;  Content  versus  response  set.  Child  Development, 
1968,  39(2),  647-656. 


Long,  L.  A study  of  the  effect  of  preceding  stimuli  upon  the  judgment 
of  auditory  intensities.  Archives  of  Psychology,  New  York,  1937,  No.  209. 


Longstaff,  H.  P.,  & Jurgensen,  C.  E.  Fakability  of  the  Jurgensen  classi- 
fication inventory.  Jo.irnal  of  Applied  Psychology-,  1953,  ^7^,  86-89. 


Longworth,  D.  S.  Use  of  a mail  questionnaire.  American  Sociological 
Review , 1953,  J_8,  310-313. 


Lord,  F.  Testing  if  two  measuring  procedures  measure  the  same  dimension. 
Psychological  Bulletin,  1973,  7_9>  71-72. 


Lord,  F.  M.  Reliability  of  multiple-choice  tests  as  a function  of  number 
of  choices  per  item.  Journal  of  Educational  Psychology,  1944,  35.  175-180 


Lord,  F.  M.  Do  tests  of  the  same  length  have  the  same  standard  errors  of 
measurement?  Educational  and  Psychological  Measurement,  1957,  1 7 , 510-521 

Data  Analysis,  Scoring  S 

Psych.  Abst.,  13,  #4704  A-N 


Lord,  F.  M.  The  self-scoring  flexilevel  test  (ERIC  Document  Reproduction 
Service,  FD  042  813).  Princeton,  New  Jersey;  Educational  Testing  Service, 
1970.  Report  No.  RB-70-43. 

Achievement  Measures 

ERIC  Document  Reproduction  Service,  ED  042  813  A-M 


-157- 


Lorge,  I.  Gen-Like:  Halo  or  reality.  Psycliological  Bulletin,  1937,  34 , 
545-546. 

Response  Bias,  Rating  Scales,  Validity,  Investigator  Error  12 
Psychological  Bulletin,  34 , 545-546  (Rev.)  A-N 


Loveless,  E.  J.  Artistic  preferences,  conceptual  thinking  and  intellectual 
attitudes . Final  report.  Notre  Dame,  Indiana:  Notre  Dame  University,  1970. 


Lozar,  C.  C.  Measuring  techniques  to  determine  architectural  satislaction 
of  human  needs  (DAOG8136).  Champaign,  Illinois:  OCE  Construction  Engineering 
Research  Laboratory,  1973. 


Lu , K . H . A measure  of  agreement  among  subjective  judgments.  Educa  tiona 1 
and  Psychological  Measurements,  1971,  75-84. 

Luchins,  A.  S.,  & Luchins,  E.  H.  On  conformity  with  true  and  false  commun- 
ications. Journal  of  Social  Psychology,  1955(a),  283-303. 

Investigator  Error  7 

Psych.  Abst. , 31 , #974  A-H 


Luchins,  A.  S.,  & Luchins,  E.  H.  Previous  experience  with  ambiguous  and 
non-ambiguous  perceptual  stimuli  under  various  social  influences.  Journa 1 
of  Social  Psychology,  1955  (b)  , 249-270. 

Investigator  Error,  Instrument  Format,  Projective  Items  3g,  13,  7 

Psych . Abs t . , 31 , #975  (Rev.  from  rept.)  R-H 


Lucky,  A.  W.,  &Grigg,  A.  E.  Repression-sensitization  as  a variable  in 
deviant  responding.  Journal  of  Clinical  Psychology,  1964,  ^(1),  92-93. 

10, 


Personality  Measures,  Response  Bias 
Psych.  Abst.  , 32,  #10058 


A-M 


12 


Lundberg,  C.  C.  A transactional  conception  of  fieldwork.  Human  Organi- 
zation , 1968,  27(1),  45-49. 


Lundberg,  G.  A.  Social  research.  New  York;  Longmans,  Green,  1929. 
Cliapters  6 and  9.  >• 


* Lusk,  E.  J.  A bip^^  adjective  screening  methodology.  Journal  of 
Marketing  Research,  1973,  9^,  202-203. 


Semantic  Differential  Items,  Response  Alternatives 
ORA 


14 

R-M 


Lyman,  H.  B.  A cogjparison  of  the  use  of  scrambled  and  blocked  items  in  a 
multi-scale  school  attitude  inventory.  Journal  of  Educational  Research, 
1949,  43(4) , 287-2 


Attitude  Measures,  Instrument  Format 


3c,  8 


1 

1 


ORA 


9 


R-H 


Lyman,  H.  B.  Test  scores  and  what  they  mean.  New  Jersey:  Prentice-Hall, 
1963. 

Lynch,  D.  0.,  & Smith,  B.  C.  To  change  or  not  to  change  item  responses 
when  taking  tests;  Empirical  evidence  for  test  takers  (ERIC  Document  Repro- 
duction Service,  ED  064  324).  Paper  presented  to  the  annual  meeting  of  the 
American  Education  Research  Association,  Chicago,  1972. 

Achievement  Items  18 

ERIC  Document'Reproduction  Service,  ED  064  324  A-M 


J 


I 


1 


I 

Macoby,  E.  A.,  & Macoby,  N.  The  interview:  a tool  of  social  science. 

Handbook  of  social  psychology  I;  Theory  and  method.  Reading,  Mass.: 

Addison-Wesley , 1954. 


-159- 


Madden,  J.  M.  Familiarity  effects  in  evaluative  iudsments. 
Personnel  Laboratory,  1960  (a).  Technical  Note  No.  60-261. 


USAF  WADD 


N/A  N/A 

N/A  T-H 

Madden,  J.  M.  A review  of  some  literature  on  iudgment  with  implications 
for  job  evaluation.  USAF  WADD  Personnel  Laboratory,  1960  (b) . Technical 
Note  No.  60-212. 

N/A  N/A 

N/A  T-M 

Madden,  J.  M.  A comparison  of  three  methods  of  rating  scale  construction. 
Journal  of  Industrial  Psychology,  1964,  ^(2),  43-50. 

Rating  Scales,  Response  Alternatives,  Military  3f , 14 

Personnel,  Reliability 

Psych . Abst . , 35 , #7168  (Rev.)  A-H 


Madden,  J.  M. , & Bourdon,  R.  D.  Effects  on  judgment  of  variations  in 
rating  scale  format.  USAF  PRL  Technical  Document  Report  No.  63-2,  1963. 

Rating  Scales,  Military  Personnel,  Instrument  3b,  3c 

Format,  Response  Alternatives 

Psych.  Abst.  , #8302  R-H 


Madden,  J.  M.,  & Bourdon,  R.  D.  Effects  of  variations  in  rating  scale 
format  on  judgment.  Journal  of  Applied  Psychology,  1964,  147-151. 

Instrument  Format,  Rating  Scales  3a 

Journal  of  Applied  Psychology.  48 . p.  151  R-NA 


-160- 


Madow,  W.  G.  On  some  aspects  of  response  error  measurement.  ProceedinRS 
of  the  Social  Statistics  Section,  American  Statistical  Association,  1965, 
182-192. 

Investigator  Error,  Interviews  13 

Bureau  of  Applied  Social  Research  A-H 


Magill,  W.  H.  The  influence  of  the  form  of  item  on  the  validity  of 
achievement  tests.  Journal  of  Educational  Psychology,  1934,  23,  21-28. 


Magnusson,  D.,  & Ekehanimer,  B.  Subiective  confidence  and  interjudge 
agreement  as  functions  of  amount  of  information:  A study  of  interview  data 
(AD  908  101).  Stockholm,  Sweden:  Stockholm  University,  Psychological  Lab., 
1972,  Report  No.  366. 


N/A 

N/A 

N/A 

T-M 

Magnusson,  D.,  & Ryder,  B.  Subjective  confidence  and  interobserver  agree- 
ment for  interview  data  as  functions  of  interview  length  (AD-853  753). 
Stockholm,  Sweden:  Stockholm  University , Psychological  Laboratories,  1967. 


N/A 

N/A 

N/A 

T-M 

Magrid,  F.  N.,  et  al.  Mail  questionnaires:  Adjunct  to  the  Interview. 
Public  Opinion  Quarterly,  1962,  111-114. 


Mahler,  I.  Yeasayers  and  naysayers  - a validating  study.  Journal  of 
Abnormal  and  Social  Psychology.  1962,  ^(4),  317-318. 

Response  Bias,  Validity  12 

ORA  R-M 


Mailer.  J.  B. 
31,  882-884. 


The  effect  of  signing  one's  name.  School  and  Society.  1930 


Mailer,  J.  B.  Character  and  personality  tests:  A descriptive  biblioRra- 

phy  including  measures  of  attitudes,  interest,  adjustment,  appreciation. 


moral  knowledge,  behavior  and  rating  scales.  New  York:  Teachers  College, 
Columbia  University,  1936. 


Malmud,  R.  S.  Controlled  versus  free  completion.  American  Journal  of 
Psychology,  1925,  401-411. 

Achievement  Measures,  Response  Alternatives  2 


Malry,  R.  C.,  & Roberts,  M.  The  testing  situation  as  a positive  source 
of  influence.  Research  Bulletin  67-35.  Princeton,  New  Jersey:  Education- 
al Testing  Service,  1967. 


Mandler,  G.  Stimulus  variables  and  subject  variables:  A caution. 
Psychological  Review,  1951,  66,  145-149. 


Manfield,  M.  N.  A new  dimension  in  consumer  ana lvsis--The  Guttman  aca] 
Paper  presented  before  the  Marketing  Research  Discussion  Group  c " the 
American  Marketing  Association  (New  York  Chapter),  1971. 


Scaling,  Questionnaire  Theory  and  Development 

Abaum,  et  al.  (Eds.)  Scientific  Marketing  Research. 
New  York:  Scott  Foresman,  1972.  (Rev.  from  rept.) 


14,  15 


Manis,  J.  G.,  Brawer,  M.  J.,  Hunt,  C.  L.,  & Kercher,  L.  C.  Validating 
a mental  health  scale.  American  Sociological  Review.  1963,  ^(1),  108-116. 


Manis,  M.  Comment  on  Upshaw's  "own  attitu-e  as  an  anchor  in  equa 1 -appear ing 
intervals."  Journal  of  Abnormal  and  Social  Psychology.  1964,  68(6),  689-691. 


-162- 


Manne,  H.  A.,  & Willing,  R.  C.  (Eds.)  Current  research  techniques  in 
military  personnel  assignment  (ERIC  Document  Reproduction  Service,  ED  049 
445).  Proceedings  of  Annual  Conference,  Military  Testing  Association,  1970. 

Questionnaire  Theory  and  Development,  Bibliography  17 

ERIC  Document  Reproduction  Service,  ED  049  445  A-H 


Manning,  P.  K.  Problems  in  interpreting  interview  data.  Sociology  and 
Social  Research,  1967,  302-316. 


Marais,  H.  C.  Evaluation  of  the  semantic  differential  as  instrument  for 
measurement  of  attitudes.  Psychological  Reports,  1967,  2_1(2) , 591-592. 

Semantic  Differential  Items  2 

Psychological  Reports,  ^(2),  p.  591  (Rev.  from  rept.)  R-M 


Maranell,  G.  M.  (Ed.)  Scaling:  A sourcebook  for  behavioral  scientists. 
Chicago,  111.:  Aldine  Publishing  Co.,  1974. 

Scaling  Bibliography  17 

Aldine  Publishing  Co.  A-H 


Marcus,  A.  The  effect  of  correct  response  location  on  the  difficulty 
level  of  multiple-choice  questions.  Journal  of  Applied  Psychology,  1963, 
^(1),  48-51. 

Multiple  Choice  Items,  Response  Bias,  Achievement  12,  3e 

Measures,  Response  Alternatives 


Psych,  Abst . , 37 , #7978 


A-M 


Margolis,  C.,  & Porter,  C.  R.  The  relationship  of  message  unity  (’puli') 
to  the  recipient's  response  potential.  Journal  of  Applied  Psychology,  1959, 
^(6),  367-371. 


Markey,  S.  C.,  Winer,  B.  J.,  & Falk,  G.  H.  Reliability  of  ratings  for 
higli-ranking  officers.  American  Psychologist,  1949,  4,  300. 


Marks,  E . S . , & Mauldin,  W.  P.  Problems  of  response  in  enumerative  surveys. 
American  Sociological  Review,  1950,  1^,  649-657. 

Response  Bias,  Interviews,  Investigator  Error  12,  13,  17 

Psych . Abst . , 26 , #4739  (Rev.  from  rept.)  R-N 


Marlett,  S.  A.  A comparison  of  vicarious  apd  direct  reinforcement  control 
of  verbal  behavior  in  an  interview  setting.  Journal  of  Personality  and 
Social  Psychology,  1970,  J^(4) , 695-703. 


Marley,  A.  A.  Some  probabilistic  models  of  simple  choice  and  ranking. 
Journal  of  Mathematical  Psychology,  1968,  311-332. 


Marquis,  K.  H.  An  experimental  study  of  the  effects  of  reinforcement. 
question  length,  and  reinterviews  on  reporting  selected  chronic  conditions 
in  household  interviews.  Ann  Arbor,  Mich.:  Survey  Research  Center, 
University  of  Michigan,  1969. 


Marquis,  K.  H.  Effects  of  social  reinforcement  on  health  reporting  in 
the  household  interviews.  Sociometry , 1970  (a),  33,  203-215. 


Marquis,  K.  H.  The  use  of  verbal  reinforcement  in  personal  interviews  and 
its  relationship  to  data  accuracy  - Section  Four.  In  Cannell,  C.  F.,  Marquis, 
K.  H.,  & Laurent,  A.,  Studies  of  interviewing  methodology-A  summary  of 
research  conducted  for  the  National  Center  for  Health  Statistics.  Ann  Arbor, 
Michigan:  Univ.  of  Michigan,  Survey  Research  Center,  1970(b).  PH-43-68-209. 

N/A  N/A 


N/A 


T-H 


-164- 


Marquis,  K.  H.  Effects  of  race,  residence,  and  selection  of  respondent 
on  the  conduct  of  the  interview.  Chapter  12,  In  Lansing,  J.  B.,  Withy, 
S.  B.,  Wolfe,  A.  C.,  et  al..  Working  papers  on  survey  research  in  poverty 
areas . Ann  Arbor,  Mich,:  University  of  Michigan,  Institute  for  Social 
Research,  Survey  Research  Center,  1971, 


Miirquis,  K.  H.,  & Cannell,  C.  F.  Effect  of  some  experimental  interview- 
ing techniques  on  reporting  in  health  interview  survey  - a methodological 
study  designed  to  test  the  effectiveness  of  certain  questionnaire  designs. 
U.  S.  National  Center  for  Health  Statistics,  Division  of  Health  Interview 
Statistics,  Public  Health  Service,  Publication  No.  1000,  Series  2,  No.  41. 


* Marquis,  K.  H.,  Cannell,  C,  F.,  & Laurent,  A.  Reporting  health  events  in 
household  interviews:  Effects  of  reinforcement,  question  length,  and  rein- 
terviews . Ann  Arbor,  Mich.:  Univ.  of  Michigan,  Institute  for  Social 
Research,  Survey  Research  Center  Vital  and  Health  Statistics,  Series  2, 

No.  45,  1972. 

Instrument  Format,  Interviews,  Respondents'  3g , 11 

Motivation,  Question  Stem 

Psych . Abs t . , 48 , #09206  A-H 


Marquis,  K.  H.,  Marshall,  J.,  & Oskamp,  S.  Testimony  validity  as  a 
function  of  question  form,  atmosphere  and  item  difficulty.  Journal  of 
Applied  Social  Psychology,  1972,  2(2),  167-186. 


N/A 

N/A 

N/A 

T-H 

* Marquis,  K.  H,,  Marshall,  J.,  & Oskamp,  S.  Accuracy  and  completeness  of 
testimony  as  a function  of  kind  of  question,  interrogation  atmosphere,  and 
item  content.  Ann  Arbor,  Michigan:  University  of  Michigan,  Survey  Research 
Center,  n.d. 

Clarity,  Interviews,  Respondent's  Motivation,  3g , 4,  11,  7 

Question  Stem,  Investigator  Error 

Bureau  of  Applied  Social  Research  (rev.)  A-H 


-165- 


Marsh,  C.  J.  The  influence  of  supplementary  verbal  directions  upon 
results  obtained  with  questionnaires.  Journal  of  Social  PsycholoRv . 1945, 
21(2),  275-281. 


N/A 

N/A 

N/A 

T-H 

Marsh,  S.  E.,  & Perrin,  F.  A.  C.  An  experimental  study  of  the  rating 
scale  technique.  Journal  of  Abnormal  and  Social  Psychology,  1925,  19, 
383-399. 

Rating  Scales  2,  3f 
ORA  R-H 


Marshall,  J.,  Marquis,  K.  H.,  & Oskamp,  S.  Effects  of  kind  of  question 
and  atmosphere  of  interrogation  on  accuracy  and  completeness  of  testimony. 
Harvard  Law  Review,  1971,  ^(7),  1620-1643. 


Marso,  R.  N.  Test  item  arrangement,  testing  time,  and  performance. 
Journal  of  Educational  Measurement,  1970,  ]_,  113-118. 


Mascaro,  G.  F.  Category-width  tendencies  and  acceptance  of  attitude 
statements.  Perceptual  and  Motor  Skills,  1968,  27_(2)  , 410. 

Response  Bias  12 

ORA  R-M 


Masling,  J.  The  effects  of  warm  and  cold  interaction  on  the  administra- 
tion and  scoring  of  an  intelligence  test.  Journal  of  Consulting  Psychol- 
ogy, 1959,  n,  336-341. 

7 


Investigator  Error 
Psych.  Abst. . 34 , #4395 


A-M 


Mason,  J.,  & Smith,  E.  M.  The  influence  of  instructions  on  respondent 
error.  Journal  of  Marketing  Research.  1970,  7,  254-5, 


Massarik,  F.,  Weschler,  I.  R. , & Tannenbaum,  R.  Evaluating  efficiency 
rating  systems  through  experiment.  Personnel  Administration.  1951  lAfll 
42-47.  ~ 


Raters,  Rating  Scales,  Investigator  Error 
Psych.  Abst. , M , #1610 


2,  7,  11 


MASSTER,  C&A  Branch,  P&E  Division,  Directorate,  THARLIE  (ACCB) . Questionnaire: 
ACCB  Handbook  of  Design.  C&A  Br . , ?&£  Division,  Directorate,  CHARLIE  (ACCB) 
Fort  Hood,  Texas,  1971.  ’ 

Textbook  17 


MASSTER,  OSD-6  P&E  Division,  Directorate  ALPHA.  Questionnaire:  OSD-6 
Handbook  of  Design.  Fort  Hood,  Texas:  OSD-6  P&E  Division,  Directorate 
ALPHA,  1972. 

Textbook  17 


* Masters,  J.  R.  Reliability  as  a function  of  the  number  of  categories  of 
a summated  rating  scale.  Dissertation  Abstracts  International  1973  33 

(8-A),  4180-4181.  ’ ’ — 


Reliability,  Response  Alternatives,  Rating  Scales 
Dissertation  Abstracts.  13(8-A)  , 4180-4181  (Rev.) 


Matarazzo,  J.  D. , & Wiens,  A.  N,  The  interview:  Research  on  its  anatomy 
and  structure.  Chicago:  Aldine  Publishing  Co.,  1974. 


-167- 


Matarazzo,  J.  D.,  et  al.  Interviewer  Mm-Hnim  and  interviewee  speech  dura- 
tions. Psychotherapy:  Theory,  Research  and  Practice.  1964,  _1(3),  109-114. 


Investigator  Error,  Interviews,  Respondent's  12,  11 

Motivation 


Psych.  Abst. , 40 . #1858 


A-H 


Matell,  M.  S.  The  psychometric  characteristics  of  Likert-type  rating 
scales  consisting  of  two-through  nineteen  steps.  Dissertation  Abstracts 
International , 1970,  ^(9-B),  4406. 

Scaling,  Respondent's  Motivation,  Response  . 3a,  11,  8 

Alternatives,  Scoring 

ORA  R-NA 


Matell,  M.  S.,  & Jacoby,  J.  Is  there  an  optimal  number  of  alternatives  for 
Likert  scale  items?  Study  I:  Reliability  and  validity.  Educational  and 
Psychological  Measurement,  1971,  ^(3),  657-674. 

Reliability,  Validity,  Response  Alternatives,  3a 

Rating  Scales 

ORA  R-H 


Matell,  M.  S.,  Jacoby,  J.  Is  there  an  optimal  number  of  alternatives  for 
Likert  scale  items?  Effects  of  testing  time  and  scale  properties.  Journa 1 
of  Applied  Psychology,  1972,  ^(6),  506-509. 

Attitude  Measures,  Response  Alternatives,  Rating  3a 

Scales 

Journal  of  Applied  Psychology,  56 , p.506  (Rev.)  R-NA 


Mathews,  C.  0.  The  effect  of  printed  response  words  upon  children's 
answers  to  two-response  types  of  tests.  Journal  of  Educational  Psychology. 
1927,  18,  445-457. 


Mathews,  C.  0.  The  effect  of  the  order  of  printed  response  words  on  an 
interest  questionnaire.  Journal  of  Educational  Psychology.  1929,  20,  128-134. 

Attitude  Measures,  Instrument  Format,  Response  3b 

Alterna  tives 

ORA 


-168- 


Matthews,  J.  J.,  Wright,  C.  E.,  & Yudowitch,  K.  L.  Analyses  of  the  results 
of  the  administration  of  three  sets  of  descriptive  adjective  phrases.  Palo 
Alto:  Operations  Research  Associates,  1975.  (Prepared  for  the  Army  Research 

Institute  for  the  Behavioral  and  Social  Sciences,  Fort  Hood,  Texas  under 
Contract  DAHC19-74-C-0032 .) 

Adjectives  6 

ORA  ' R-H 


Mausner,  B.  The  effect  of  instructed  bias  on  iudp  s in  a Thurstone  scale 
construction.  (Doctoral  dissertation.  University  of  Pittsburgh),  Pittsburgh, 
Penn.:  Graduate  School  of  Public  Health,  1960. 


Maxwell,  A.  E.,  & Pilliner,  A.  E.  Deriving  coefficients  of  reliability  and 
agreement  for  ratings.  British  Journal  of  Mathematical  and  Statistical 
Psychology.  1968,  n(l) , 105-116. 


May,  M.  A.,  & Hartshorn,  H.  First  steps  toward  a scale  for  measuring 
attitudes.  Journal  of  Educational  Psychology.  1926,  1^,  145-162. 


Mayer,  C.  S.,  & Pratt,  R.  W.,  J.  A note  on  nonresponse  in  a mail  survey. 
Public  Opinion  Quarterly,  1966-67,  637-646. 


Maynes , E.  S.  The  anatomy  of  response  crrors-Consumer  saving.  Journal  of 
Marketing  Research.  1965,  2(4),  378-387. 


Mayo,  G.  D.  Peer  ratings  and  halo.  Educational  and  Psychological  Measure- 
ment, 1956,  U,  317-323. 

Raters  12 

Psych . Abst . , 32 , #2783  A-H 


Mays,  R.  J.  Relationships  between  length  of  acquaintance  and  nature  of 
trait  rated  and  agreement  between  raters.  AFPTRC  Res.  Bull..  1954,  54-55. 


McClanahan,  A.  U.  A Monte  Carlo  evaluation  of  RAW,  FRED,  PROF,  and  COPAN: 
Four  techniques  for  capturing  and  clustering  rater  strategies.  Dissertation 
Abstracts  International,  1973,  33(8-B) , 3987. 


McClusky,  H.  Y.  The  negative  suggestion  effect  of  the  false  statement 
in  the  true-false  test.  Journal  of  Experimental  Education,  1934,  7^,  267-273. 


McCollough,  C.,  & Van  Atta,  L.  Introduction  to  descriptive  statistics  and 
correlation . New  York:  McGraw-Hill,  1965. 


McConochie,  W.  A.  Comparison  of  traditional  attitude  measuring  techniques 
with  a new  technique.  Dissertation  Abstracts  International,  1970,  31(2-B) , 
901. 


McCord,  H.  Discovering  the  "confused"  respondent:  A possible  projective 
method.  Public  Opinion  Quarterly.  1951,  1^,  363-366. 

Response  Bias,  Preference  Measures  2,  12 

Psych.  Abst. , #5495  A-H 


McCormick,  E.  J.  Effect  of  amount  of  job  information  required  on  relia- 
bility of  incumbenTs'  check-list  reports!  USAF  WADD  Technical  Note  No. 
60-142,  1960. 

Military  Personnel,  Check  List,  Instrument  Length,  5,  14 

Re 1 lability 

Psych.  Abst. , ^,  #7165  A-M 


McCormick,  E.  J.,  & Bachus,  J.  A.  Paired  comparisons  ratings.  I.  The 
effect  of  ratings  of  reductions  in  the  number  of  pairs.  Journal  of  Applied 
Psychology , 1952,  3^,  123-127  . 


Paired  Comparison  Items,  Instrument  Length 


3a,  5,  13 


* McCormick,  E.  J.,  & Roberts,  W.  K.  Paired  comparison  ratings:  2.  The 
reliability  of  ratings  based  on  partial  pairings.  Journal  of  Applied 
Psychology , 1952,  3^,  188-192. 

Paired  Comparison  Items,  Reliability,  Instrument  3a,  5,  1 

Length 

Psych.  Abst.  , 27 , #3062  ^ ^ 


McCormick,  L.  C.  A rationale  for  scaling  unordered  attributes.  America 
Journal  of  Sociology.  1948,  54,  31-35. 


McDonagh,  E.  C.,  & Rosenblum,  L.  A.  A comparison  of  mailed  questionnaires 
and  subsequent  structured  Interviews,  Public  Opinion  Quarterly,  1965,  7^, 
131-136. 


McElwain,  D.  W.  A note  on  item  analysis.  Guidance  Review,  1964,  ^(2), 
56-65. 


* McGarvey,  H.  R.  Anchoring  effects  in  the  absolute  judgment  of  verbal 
materials.  Archives  de  Psychologie,  Geneve,  1943,  No.  281. 

Questionnaire  Theory  and  Development,  Rating  Scales  3f,  14 


McGee,  R.  K.  The  relationship  between  response  style  and  personality 
variables  - I.  The  measurement  of  response  acquiescence.  Journal  of  Ab- 
normal and  Social  Psychology,  1962,  ^(3),  229-233. 


McGinnies  E.  Cross-cultural  Investigation  of  some  factors  in  persuasion 
and  attitude  change  (AD  727  625).  College  Park,  Maryland-  Maryland  Univ., 
Dept,  of  Psychology,  1971.  Contract  No.  NONR-595(21) . 

Questionnaire  Theory  and  Development 

DDC,  #AD  727  625 


A-N 


-172- 


McNemar,  Q.  Opinion-attitude  methodology.  Psychological  Bulletin,  1946, 
289-374. 


Attitude  Measures,  Bibliography,  Literature  Review, 

Reliability,  Validity  16 

ORA  R-H 


McReynolds,  P.  (Ed.)  Advances  in  psychological  assessment,  Vol.  I.  Palo 
^ Alto,  Calif.:  Science  and  Behavior  Books,  1968. 


Medland,  F.  F.  Predictor  indices  for  officer  personnel  actions-  officer 
indices  (DAOZ9849) . Arlington,  Va  . : OCRD  Behavioral  and  Systems  Research 
Laboratory,  1973. 

N/A  18 
DDC  (Rev.)  A-NA 


Medley,  D.  M.  The  influence  of  item  modality  on  the  dimension  measured 
by  a test.  Journal  of  Experimental  Education,  1956,  303-307. 


Acliievement  Measures 


18 


Psych.  Abst. , 3]^,  #6929 


A-N 


Meehl,  P.  E.,  & Hathaway,  S.  R.  The  K factor  as  a suppressor  variable  in 
the  MMPI.  Journal  of  Applied  Psychology  , 1946,  525-564. 


* Mehling,  R.  A simple  test  for  measuring  intensity  of  attitudes.  Public 
Opinion  Quarterly,  1959,  ^(4),  576-578. 

Semantic  Differential  Items,  Attitude  Measurements,  8,  2 

Rating  Scales 

Psych.  Abst.  , 37_,  #1070  A-M 


Meisels,  M.  , & Ford,  L.  H.,  Jr.  Social  desirability  response  set  and 
semantic  differential  evaluative  judgments.  Journal  of  Social  Psychology, 
1969,  78(1),  45-54. 


-173- 


Meister,  D.,  & Rabicleau,  G.  Human  factors  evaluation  in  systems,  devc  lo£- 
ment.  New  York:  John  Wiley,  1965. 


Mendelsohn,  M. , & Linden,  J.  Development  of  an  atypical  response  scal^ 
(ERIC  Document  Reproduction  Service,  ED  059  244).  Paper  presented  at  the 
Annual  Meeting  of  the  Midwestern  Psychological  Association,  Detroit, 
Michigan,  1971. 

Questionnaire  Theory  and  Development  18 

ERIC  Document  Reproduction  Service,  ED  059  244  A-N 


Menefee,  S.  C.  The  effect  of  stereotyped  words  on  political  judgments. 
American  Sociological  Review,  1936,  1(1),  ol4-621. 


Menny  J.  W.,  & Tolsma , R.  J. 
ments  using  group  responses. 
(1),  5-7. 


A discrimination  index  for  items  in  instru- 
Journal  of  Educational  Measurement,  1971,  8 


Merenda,  P.  F.,  & Clarke,  W.  V.  Forced-choice  vs.  free-response  in 
personality  assessment.  Psychological  Reports,  1963,  18 (1)  , 1j9-169. 

Forced-Choice  Items,  Personality  Measures,  Check-List,  2,  11 

Respondent's  Motivation 

Psych.  Abst. , 38,  #6089 


Merrens,  M.  Generality  and  stability  of  extreme  response  styles. 
Psychological  Reports,  1970,  2_7,  802. 


Merton,  R.  K.  Fact  and  factitiousness  in  ethnic  opinionnaires . American 
Sociological  Review,  1940,  13-28. 


Merton,  R.  K.,  Fiske,  M. , Sc  Kendall,  P.  L.  The  focused  interview. 
Glencoe,  111.:  The  Free  Press,  1956. 


-174- 


Messick,  S.  Separate  set  and  content  scores  for  personality  and  attitude 
scales.  Research  Bulletin  61-16.  Princeton,  New  Jersey:  Educational 
Testing  Service,  1961. 


Messick,  S.  Response  style  and  content  measures  from  personality  inven- 
tories. Educational  and  Psychological  Measurement,  1962,  22,  41-56. 


Messick,  S,  J.  Response  sets.  Research  Memorandum  64-8.  Princeton,  New 
Jersey:  Educational  Testing  Service,  1964. 


N/A 

N/A 

N/A 

T-M 

Messick,  S.,  & Fritzky,  F.  J.  Dimensions  of  analytic  attitude  in  cognition 
and  personality.  Journal  of  Personality.  1963,  346-370. 


Messick,  S.,  & Kogan,  N.  Differentiation  and  compartmentalization  in 
object-sorting  measures  of  categorizing  stvle.  Perceptual  and  Motor  Skills, 
1963,  16,  47-51. 


Messick,  S.,  & Ross,  J.  (Eds.)  Measurement  in  personality  and  cognition. 
New  York;  Jolin  Wiley,  1962. 


Metfcssel,  N.  S.,  & Sax,  G.  Systematic  biases  in  the  keying  of  correct 
responses  on  certain  standardized  tests.  Educational  and  Psychological 
Measurement , 1958,  1^,  787-790, 

Invest iga t<^r  Error,  True-False  Items,  Response  13,  3c 

Alternatives 

Paych  . Abs  t . , 34 , #152  A-M 


Metzner,  C.  A.  An  application  of  scaling  to  questionnaire  construction. 
Journal  of  the  American  Statistical  Association,  1950,  ^ (249),  112-118. 

Clarity,  Paired  Comparison  Items,  Question  Stem,  4,  14,  Jg 

Scaling 


Bureau  of  Census,  #7111009601  (Rev.) 


R-M 


Metzner,  H.,  & Miinn,  F.  A limited  comparison  of  two  methods  of  data  collec- 
tion: The  fixed  alternative  questionnaire  and  the  open-ended  interview. 
American  SocioloRical  Review,  1952,  1^  (4),  486-491. 

Interviews,  Attitude  Meabures,  Respondent's  Motiva-  1,  9,  11 

tion,  Anonymous  Respondent 

ORA  R-M 


Metzner,  H.,  & Mann,  F Effects  of  grouping  related  questions  in  question- 
naires. Public  Opinion  Quarterly,  1953,  _1Z,  136-141. 

Instrument  Format  3c 

Psych . Abs t . , 28  , #2467  (Rev.  from  rept.)  R-H 


Meyer,  G.  The  choice  of  questions  on  essay  examinations.  Journal  of 
Educational  Psychology,  1939,  161-171. 


Micliael,  J.  J.  The  reliability  of  a multiple-choice  examination  under 
various  test-taking  instruction.  Journal  of  Educational  Measurement,  1968, 
5,  307-314. 


Micklin,  M. , & Durbin,  M.  Syntactic  dimensions  of  attitude  scaling  tech- 
niques: Sources  of  variation  and  bias.  Sociometr''’ , 1969,  3^(2),  194-206. 


Hicko,  H.  C.,  & Fischer,  W.  The  metric  of  multidimensional  psychological 
spaces  as  a function  of  the  differential  attention  to  subjective  attributes. 
Psychometrika , 1970,  3^,  199-227. 


Miklicii,  D.  R.  Item  characteristics  and  agreement-disagreement  response 
set.  Dissertation  Abstracts,  1966,  ^(10),  6210. 

1?-.  4,  3g 


Response  Bias,  Clarity,  Question  Stem 
Dissertation  Abstracts,  ^(10)  , #6210 


R-H 


Milhalland,  J.  E.  The  scale 
law  after  a 25-year  interval. 
1958,  131-132. 


values  of  statements  of  attitude  toward  the 
Journal  of  Abnormal  and  Social  PsycholoRV, 


Milhalland,  J.  E.  Theory  and  techniques  of  assessment.  Annual  Review  of 
PsycholoRv , 1965,  1_5,  311-346. 


Miller,  D.  C.  Handbook  of  research  desien  and  social  measurement.  (2nd 
ed.)  New  York:  David  McKay,  1970. 


Miller,  G.  G.  Development  of  improved  course  evaluation  methodoloRV  for 
technical  traininR  (DF220180) . Lowry  AFB,  Colorado:  Air  Force  Human  Re- 
sources Laboratories,  June  1973. 

Attitude  Measures,  Military  Personnel  14,  9 
DDC  (Rev.)  A-NA 


Miller,  I.,  & Minor,  F.  J.  Influence  of  multiple-choice  answer  form  design 
on  answer-marking  performance.  Journal  of  Applied  Psychology,  1963,  47(6)  , 
374-379. 

Multiple  Choice  Items,  Instrument  Format  2,  3g,  1 

Psych . Abs  t . ,.  38  , #6646  A-M 


Mixler,  N.,  Doob,  A.  N.,  Butler,  D.  G.,  & Marlowe,  D.  The  tendency  to 
agree:  Situational  determinants  and  sicial  desirability.  Journal  of  Ex- 
perimental Research  in  Personality,  1965,  _1,  78-83. 


Miller,  S.,  et  al.  Use  of  the  semantic  differential  in  the  study  of  moti- 
vation. Psychological  Reports,  1971,  ^(3,,  1279-1282. 


Mills,  J.  D.  Do  effective  research  with  a good  questionnaire.  Industrial 
Marketing,  1972,  2Z’  54. 


-177- 


Milne,  D.  W.  A comparison  of  scaling  procedures  and  their  evaluation  as 
estimates  of  external  criteria.  Dissertation  Abstracts,  1966,  ^(9),  5542. 


Mindak,  W.  A.  A new  technique  for  measuring  advertising  effectiveness. 
Journal  of  Marketing.  1956,  W,  367-378. 


Semantic  Differential  Items 


17 


ORA 


R-N 


Mindak,  W.  A.  Fitting  the  semantic  differential  to  the  marketing  problem 
Journal  of  Market! oh.  1961,  1^,  28-33. 

Semantic  Differential  Items  15 

ORA  R-N 


* Miron,  M.  S.  The  influence  of  instruction  modification  upon  test-retest 
reliabilities  of  the  semantic  differential.  Educational  and  Psychological 
Measurement , 1961,  883-893. 

Reliability,  Semantic  Differential  Items,  Investi-  7,  11 

gator  error 

Educational  and  Psychological  Measurement.  21 , p.  892  R-H 


* Mittelsteadt , R.  A.  Semantic  properties  of  selected  evaluative  adjectives; 
Other  evidence.  Journal  of  Marketing  Research,  1971,  8,  236-237. 


Ad jee  t ives 
ORA 


6 

R-H 


Modern  Army  Selected  Systems  Test,  Evaluation,  and  Review  (MASSTER) . MASSTER 
Test  Officer's  Planning  Manual.  Fort  Hood,  Texas:  Headquarters  MASSTER, 
June,  1974. 


-178- 


* Mogar,  R.  E.  Three  versions  of  the  F Scale  and  performance  on  the 

Semantic  Differential.  Journal  of  Abnormal  and  Social  PsycholoKV.  1960, 
^ 262-265. 

Personality  Measures,  Response  Bias,  Semantic 

Differential  Items  10 

Psych.  Abst . , 34,  #7648  A-N 


Mollenkopf,  W.  B.  Time  limits  and  behavior  of  test  takers.  Educa  t iona 1 
and  Psychological  Measurement,  1960,  2Q.,  223-230.  • 

Instrument  Length,  Investigator  Error,  7 

Achievement  Measures 


Psych.  Abst . , 35 , #7089  (Rev.  from  rept.) 


Moore,  J.  C.,  & Rubin,  Z.  Assessment  of  subjects'  suspicions.  Journal  of 
Personality  and  Social  Psychology,  1971,  ^(2),  163-170. 


Moore,  M.  Moderator  effects  of  ambivalence  in  attitude  measurement.  Paper 
presented  to  the  Midwestern  Psychological  Association,  Cleveland,  1972. 


Moore,  M.  Ambivalence  in  attitude  measurement.  Educational  and  Psycho- 
logical Measurement,  1973,  33(2),  481-483. 


Mordkoff,  A.  M.  Functional  vs  nominal  antonymy  in  semantic  differential 
scales.  Psychological  Reports.  1965,  J|^(3,  Part  1),  691-692. 


Morgan,  R.  Interviewer  introspection  in  'bias'.  Public  Opinion  Quarterly. 
1947,  11(4),  615-616. 


Morton,  M.  A.,  Hoyt,  W.  G-,  t«  Burke,  L.  K.  A new  type  of  test  answer 
sheet.  American  Psychologist.  1955,  10,  572. 


-179- 


Moscovici,  S.  Attitudes  and  opinions.  Annual  Review  of  Psychology.  1963, 
14,  231-260. 


Mosel,  J.  N.,  6t  Cozan,  L.  W.  The  accuracy  of  application  blank  work  histor- 
ies. Journal  of  Applied  Psychology,  1952,  36,  365-369. 


Mosel,  J.  N.,  & Goheen,  H.  W.  The  employment  recommendation  questionnaire: 
III.  Validity  of  different  types  of  references.  Personnel  Psychology,  1959 
J^,  469-477. 


Moser,  C.  A.  Interview  bias.  Review  of  the  International  Statistical 
Institute , 1951,  1^(1),  28-40. 


Moser,  C.  A.  Survey  methods  in  social  investigation.  London:  Heinemann, 
1958. 


Bibliography 

Psych . Abs t . , 33 , #10.138 


16 


A-H 


* Mosier,  C.  I.  A modification  of  the  method  of  successive  intervals. 
Psychometrika , 1940,  5^,  101-107. 

Adjectives,  Scaling,  Reliability  6,  14,  8,  2 

ORA  R-H 


* Mosier,  C.  I.  A psychometric  study  of  meaning.  Journal  of  Social  Psychol- 
ogy , 1941  (a),  123-140. 

Adjectives  6 

ORA  R-H 


* Mosier,  C.  I.  Tables  from  a quantitative  study  of  meaning.  (Privately 
issued.)  1941  (b)  . 

Adjectives  6 

N/A  T-H 


-18'’- 


Hosier,  C.  I.,  & McQuitty,  J.  V.  Methods  of  item  validation  and  abacs  for 
item-test  correlation  and  critical  ratio  of  >jpper-lower  difference. 
Psychometr ika , 1940,  57-65. 


* Hosier,  C.  I.,  & Price,  H.  G.  The  arrangement  of  choices  in  M-C  questions 
and  a scheme  for  randomizing  choices.  Educational  and  Psychological  Measure- 
ment, 1945,  5,  379-382. 

Multiple  Choice  Items,  Response  Alternatives  3b,  12 

ORA  R'M 


Moskowitz  , H.  R.  Studies  of  validating  evaluation  panels:  Test  environ- 
ments data,  development  of  new  measurement  techniques  for  food  acceptance 
(DA0D4177) . Natick,  Mass.:  Army  Natick  Laboratories,  1973. 

N/A  N/A 

N/A  R'NA 


Hosteller,  F.,  Bush,  R.  R. , & Green,  B.  F.  Selected  quantitative  tech- 
niques and  attitude  measurement.  Reading,  Mass.:  Addison-Wes ley  , 1970. 


Mount,  E.  Communications  barriers  and  the  reference  question-problem  of 
inquiries  not  asking  the  reference  questions  they  actually  should  pose  to 
obtain  information  they  need.  Special  Librarian,  1966,  575-578. 

Mudd,  S.  A.  The  treatment  of  handling-qualities  rating  da ta . Human 
Factors  , 1969,  n_(4) , 321-330. 


Mueller,  D.  J.  A technique  for  the  utilization  of  items  with  highly  skewed 
response  distributions  in  personality  scaling.  Journal  of  Experimental 
Education,  1972,  ^ 4),  62-64. 


Mullins,  C.  J.,  Massey,  I.  H.,  & Riederich,  L.  D.  Reasons  for  air  force 
enlistment . Technical  Report  No.  68-101,  USAF  AFHRL,  1968. 


-181- 


* Murphy,  E.  F.,  Bailey,  R.  M. , & Covell,  M.  R.  Observations  on  methods  to 
determine  food  palatability  and  comparative  freezing  quality  of  certain  new 
strawberry  varieties.  Food  Technology . 1954,  8,  113-116. 


Ranking,  Rating  Scales 


2 


Psych.  Abst.  , 29,  #3266  (Rev.) 


A-H 


Murphy,  G.,  & Likert,  R.  Public  opinion  and  the  individual.  New  York: 
Harper,  1938. 


Murphy,  G.,  Murphy,  L.  B.,  & Newcomb,  T.  M.  Experimental  social  psychology. 
New  York:  Harper,  1937. 


Murphy,  R.  W.  How  and  where  to  look  it  up.  New  York:  McGraw-Hill,  1959. 


* Muscio,  B.  The  influence  of  the  form  of  a question.  British  Journal  of 
Psychology . 1916,  8,  351-389, 

Instrument  Format  3g 

ORA  R-H 


Myers,  A.  E.  Judgments  of  variability  in  two  response  modes.  Research 
Bulletin  64-40.  Princeton,  New  Jersey:  Educational  Testing  Service,  1964. 

N/A  N/A 

N/A  T-M 


Myers,  C.  T.  A note  on  the  standard  length  of  a test.  Psychometr ika . 
1961,  26,  443-446. 


* Myers,  C.  T.  The  relationship  between  item  difficulty  and  test  validity 
and  reliability.  Educational  and  Psychological  Measurement.  1962,  ^ (3), 
565-571, 

8 


Validity,  Reliability,  Question  Stem 
Psych.  Abst.  , 22.»  #4936 


A-N 


Myers,  J.  H.  Finding  determinant  attributes.  Journal  of  Advertising 
Research , 1^(6),  9-12. 


Data  Analysis 
ORA 


8 

R-N 


Myers,  J.  H.,  & Alpert,  M.  I.  Determining  Buying  Attitudes:  Meaning  and 
measurement.  Journal  of  Marketing,  1968,  32,  13-20. 


Attitude  Measures,  Questionnaire  Theory  and  Develop- 
ment 


ORA 


15 


R-N 


* Myers,  J.  H.,  & Hang,  A.  F.  How  a preliminary  letter  affects  mail  survey 
returns  and  costs.  Journal  of  Advertising  Research,  1967,  £(3)  , 37-40. 


Respondent's  Motivation 
ORA 


11 

R-M 


* Myers,  J.  H.,  & Warner,  W.  G.  Semantic  properties  of  selected  evaluation 
adjectives.  Journal  of  Marketing  Research,  1968,  2>  409-412. 


Adjectives 

ORA 


6 

R-H 


Naidoo,  J.  C.  An  inquiry  into  the  structure  of  attitudes  and  behavior:  A 
validation  study  (AD  647  210).  Urbana , Illinois:  University  of  Illinois, 
Group  Effectiveness  Research,  1966.  Report  No.  TR-38. 


Questionnaire  Theory  and  Development 
DDC,  #647210 


18 

A-N 


Nakamura,  C.  Y.  Salience  of  norms  and  order  of  questionnaire  items:  Their 
effect  on  responses  to  the  items.  Journal  of  Abnormal  and  Social  Psycholoav. 
1959,  139-142. 


Instrument  Format,  Investigator  Error 
Psych.  Abst. , 3^,  #4199 


3c,  12 
A-M 


-183- 


Namaksy,  J.,  Jr.,  Bonertz,  D. , Lyon,  L.  B.,  & Schuster,  E.  P.  Cost/effect- 
iveness of  Air  Force  all-volunteer  force  proRrams  (AD- 920  615L) . Maxwell 
AFB,  Ala,:  Air  University,  1974.  Report  No.  AU-1930-74. 


N/A 

N/A 

N/A 

T-M 

Namias,  J.  Measuring  variation  in  interviewer  performance.  Journal  of 
Advertising  Research,  1966,  ^(1),  8-12. 

Interviews,  Investigator  Error  13 

Bureau  of  Census,  #7111012301  A-N 


Nash,  A.  N.  Modification  of  forced-choice  format  for  use  in  personal 
selection  and  appraisal.  Psychological  Reports,  1971,  ^(1),  108-110. 


Forced  Choice  Items,  Response  Alternatives 


14,  3a 


ORA 


R-H 


Nathan,  E.  D.  Art  of  asking  questions.  Personnel , 1966  (a),  63-71. 


Nathan,  E.  D.  Asking  questions  that  get  results.  Supervisory  Management, 
1966  (b),  U,  4-8. 

Attitude  Measures  3g 

ORA  R-N 


National  Education  Association,  Research  Division.  The  questionnaire. 
Washington,  D.  C.:  National  Education  Association,  Research  Division,  NEA 
Research  Bulletin,  1930,  _0(1)  , 1-15. 


Naval  School  of  Aviation  Medicine.  The  deliberate  use  of  a set  to  "fake" 
in  personality  questionnaires.  Pensacola,  Florida:  Naval  School  of  Aviation 
Medicine.  Report  No.  29,  Project  NM  14  12  11  Subtask  1. 


-184- 


Needham,  J.  G.  The  time-error  in  comparison  judgments.  Psychological 
Bulletin.  1934,  n,  229-243. 


Neeley,  T.  E.  A study  of  error  in  the  interview.  New  York:  Columbia  Un 
1937.  (Unpublished  doctoral  dissertation),  (S  N29  1937''. 

N/A  N/A 

n/a  x-h 


Neidt,  C.  0.,  & Merrill,  W.  R.  Relative  effectiveness  of  two  types  of 
response  to  items  of  a scale  on  attitudes  toward  education.  Journal  of 
Educational  Psychology,  1951,  432-436. 

Attitude  Measures,  Rating  Scales,  Paired  Comparison  2,  3a 

Items,  Validity,  Reliability,  Response  Alternatives 

Psych.  Abst. , 2^,  #5102  A-H 


Nelson,  M.  J.,  Uenny , E.  C.,  & Coladarci,  A.  P.  Statistics  for  teachers 
New  York:  The  Dryden  Press,  1956.  " ~~~ 


Newhall,  S.  M.  Comparability  or  the  method  of  single  stimuli  and  the 
method  of  paired  comparisons.  American  Journal  of  Psychology,  1954.  67 
96-103.  “ 

Paired  Comparison  Items,  Close-Ended  Items  2 

Psych.  Abst.  , 2^,  #63  A-H 


Newman,  S.  Differences  between  early  and  late  respondents  to  a mailed 
survey.  Journal  of  Advertising  Research,  1962,  2(2),  37-39. 


N.'Wman,  S.  H.  Quantitative  analysis  of  verbal  evaluations.  Journal  of 
Applied  Psychology,  1954,  38,  293-296. 


1 

I 

Newman,  S.  H.,  Bobbitt,  J.  M. , & Cameron,  D.  C.  The  reliability  of  the 
interview  method  in  an  officer  candidate  evaluation  program.  American 
Psychologist , 1946.  J^(4)  , 103-109. 


Nias,  D.  K. , Wilson,  D.  G.,  & Woodbridge,  J.  M.  Test-retest  results  on 
the  conservatism  scale  completed  under  conditions  of  anonymity  and  identi- 
fication. British  Journal  of  Social  and  Clinical  Psychology.  1971,  10(3) , 
282-283. 


Nicholls,  J.  Some  effects  of  testing  procedure  on  divergent  thinking. 
Child  Development,  1971,  ^(5),  1647-1651. 


Nisselson,  H.  , & Woolsey,  T.  D.  Some  problems  of  the  household  interview 
design  for  the  national  health  survey.  Journal  of  the  American  Statistical 
Association , 1959,  69-86. 


Nixon,  J.  E.  The  mechanics  of  questionnaire  construction.  Journal  of  i 

Educational  Research,  1954,  ^(7),  481-487.  j 

3 

i 

Noe  lie -Neumann , E.  Wanted:  Rules  for  wording  structured  questionnaires. 

Public  Opinion  Quarterly.  1970,  191-201. 


Nordlie,  P.  G.  Further  studies  of  attitude  measurement  by  a word  associa- 
tion technique  (AD  3 308).  Baltimore,  Maryland:  University  of  Maryland, 
Dental  School,  1952.  Report  No.  TR-17. 


Norman,  R.  P.  Extreme  response  tendency  as  a function  of  emotional  adjust- 
ment and  stimulus  ambiguity.  Journal  of  Consulting  and  Clinical  Psychology. 
1969,  33,  406-410. 


Norman,  W.  T.  Personality  measurement,  faking,  and  detection:  An  assess- 
ment method  for  use  in  personnel  selection.  Journal  of  Applied  Psychology 
1963(a),  225-241. 


-186- 


* Norman,  W.  T.  Relative  importance  of  test  item  content.  Journal  of  Con- 
sul tins  Psychology,  1963  (b) , 2J_(2)  , 166-174. 

Validity,  Personality  Measures,  Question  Stem  8,  4 

Psych.  Abst.  , 32.,  ^^^7980  A-M 


* North,  W.  E.,  & Schmid,  J.  A comparison  of  three  ways  of  phrasing  Likert 
type  attitude  items.  Journal  of  Experimental  Education,  1960,  29,  95-100. 

Attitude  Measures,  Clarity,  Investigator  Error, 

Military  Personnel,  Reliability  3g 

ORA  R-H 


Norton,  J.  K.  The  Questionnaire.  NEA  Research  Bulletin  8,  No.  1,  1930. 


Nosanchuk,  T.  A.,  & Marchak,  M.  P.  Pretest  sensitization  and  attitude 
change.  Public  Opinion  Quarterly,  1969,  32(1),  107-111. 


Nowakowska,  M.  A model  of  questionnaire  responding.  Przeglad  Psychologi- 
czny , 1970,  22.,  95-114. 


Nowakowski,  M.  A model  for  answering  a questionnaire  item.  Psychological 
Bulletin,  1971,  2 (1),  37-45. 


* Nuckols,  R.  C.  A note  on  pre-testing  public  opinion  questions.  Journal  of 
Applied  Psychology,  1953,  37_,  119-120. 

Clarity 

Psych . Abst . , 28 , #730  A-M 


Nunnally,  J.  Psychometric  Theory.  New  York:  McGraw-Hill,  1967. 


-187- 


* Nunnally,  U.,  & Husek,  T.  R The  phony  language  examina tion-an  approach  to 
the  measurement  of  response  bias.  Educational  and  Psychological  Measurement, 
1958,  18(2),  275-282. 

Response  Bias  12,  9 

Bureau  of  Census,  #7131802001  A-H 


Obe , E . 0 . Probabilistic,  elimination,  weighted-choice  and  conventional 
techniques  of  mul Liple-choice  testing.  (Unpublished  doctoral  dissertation. 
University  of  Pittsburgli)  , Pittsburgh,  Pa.:  1971. 


Oda , K.  A study  of  the  construction  of  rating  scales:  i.  Bulletin  of 

the  Faculty  oi  Education,  Nagoya  University,  1967  , 155. 


* O'Dell,  R.  Personal  interviews  or  mail  panels?  ’'ournal  of  Marketing, 
1962,  26(4)  , 34-39. 

Interviews,  Open-Ended  Items,  Clarity,  Instrument  1,  3c,  4,  12 

Format,  Respondent's  Motivation 

ORA  R-H 


* Odesky,  S.  F.  Handling  the  neutral  vote  in  paired  comparison  product 
testing.  Journal  of  Marketing  Research,  1967,  ^ 199-201. 

Data  Analysis,  Paired  Comparison  Items,  Preference  8,  3a 

Measures,  Response  Alternatives,  Scoring 

Journal  of  Marketing  Research,  4,  p.  201  R-H 


* Ognibene,  P.  Traits  affecting  questionnaire  response.  Journal  of  Adver- 
tising Research,  1973,  ^(4),  29-34. 

Investigator  Error  9 

ORA  R-H 


-188- 


O'Gorman,  J.  G.  Rating  in  performance  appraisal:  A bibl iosraphy . 
Australian  Army  Psycholuj^ical  Research  Unit  Research  Report,  No.  4,  1972. 


N/A 


N/A 


N/A 


T-H 


Ogus , J.  L.  Forms  design  - 1958  and  1963  results.  Washington,  D.  C.: 
Bureau  of  the  Census,  (64- 111 (MRD) ) , 1964. 


0 Leary,  K.  D.  The  effects  of  observer  bias  in  field-experimental  settings. 
Final  report.  (ERIC  Document  Reproduction  Service,  ED  078086).  Stony 
Brook,  N.Y.:  State  University  of  New  York,  Dept,  of  Psychology,  1973. 

Investigator  Error  2^3 

ERIC  Document  Reproduction  Service,  ED  078086  A-H 


Olson,  W.  C.  The  waiver  of  signature  in  personal  reports.  Journal  of 
Applied  Psychology,  1936,  2^  (4),  443-450.  ' 

Anonymous  Respondent  32^ 

Potter,  Sharpe,  Hendee , and  Clark,  1972  A-M 


O'Neill,  J.  J.  A comparative  study  of  intelligibility  values:  Forms  A 
and_JB.  U.  S.  Naval  School  of  Aviation  Project  Report  No.  NM  001  104  500.47, 

Achievement  Measures  2g 

Psych.  Abst. . 30 , #8252  a.m 


Ong . J.  The  opposite-form  procedure  in  inventory  construction  and  researrh 
NYC:  Vantage  Press,  1965.  — 


Oosterhof,  A.  C.,  & Clnsnapp,  D.  R.  Comparative  reliabilities  of  the 
multiple  clioicc  and  true-false  formats  (ERIC  Document  Reproduction  Service, 

ED  064  361) . Paper  presented  at  the  Annual  Meeting  of  the  American  Education 
al  Research  Association , Chicago,  1972  . 

1 p 

Achievement  Measures 

ERIC  Document  Reproduction  Service,  ED  064  361  A-N 


Operational  Test  and  Evaluation  Command,  Operational  test  and  evaluation 
methodology  guide.  Draft.  Fort  Rclvoir,  Va  . : 1973. 

Data  Analysis,  Questionnaire  Theory  and  Development  17 


Operation  Research  Associates.  Field  Equipment  Test  Methodology  Study. 
Mid-Term  Contract  Report.  Appendix  H.  Questionnaire  Constructipnand 
Development  Techniques.  Contract  No.  DAi\D-05-73-Q-0551 . Palo  Alto,  Ca  . 
ORA,  1973 


Textbook 


1 7 


ORA 


R-H 


Operations  Research  Associates.  Operational  Test  Methodology'  Guide. 
Volume  II.  Techniques  and  Guidelines.  Annex  E.  Questionnaire  Dovelopmenj, 
and  Analysis  Technique.  Contract  No.  DAAG39-7 3-0107  . Palo  Alto,  Ca  . : 
ORA,  1974. 

Textbook 


Oppenheim,  A.  N.  Questionnaire  design  and  attitude  measurement.  Now  York: 
Basic  Books,  1966. 

Textbook 

Potter,  Sharpe,  Hendee  and  Clark,  1972  (Rev.)  A-11 


-190- 


Orne,  M.  T . On  tlie  social  psychology  of  the  psycliologica  I expcr  imcni  ; 
With  particular  reference  to  demand  cliaracteristics  and  their  imp  1 i t a t ions . 
American  Psychologist,  1962,  776-783. 


Orpen,  C.  The  wheel  and  the  table:  The  relative  merits  of  two  alternative 
instruments  for  collecting  semantic- type  data.  British  Journal  of  Educa- 
tional Psychology,  1972,  ^(1),  86-87. 

Instrument  Format  3c 

ERIC  Document  Reproduction  Service,  EJ  054524  A-M 


Osburn,  B.  H.,  Lubin,  A.,  Loefflcr,  J . C . , & Tye,  V.  M.  The  relative 
validity  of  forced  choice  and  single  stimulus  selfdescr  ip tion  items. 
Educational  and  Psychological  Measurement,  1954,  J^,  407-417. 

Forced  Choice  Items,  Instrument  Format,  Validity  2 

Psych.  Abst . , 29 , #1813  (Rev.  from  rept.)  R-H 


Osburn,  H.  G.  The  effect  of  item  stratification  on  errors  of  measurement. 
Educational  and  Psychological  Measurement,  1969,  2^(2),  295-301. 


Osgood,  C.  E.  The  nature  and  measurement  of  meaning.  Psychological 
Bulletin.  1952,  197-237. 

Questionnaire  Theory  ant  Development,  Semantic  2,  14 

Differential  Items 

Psychological  Bulletin,  49 , p.  231  R-M 


Osgood,  C.  E.  Method  and  theory  in  experimental  psychology.  New  York: 
Oxford  University  Press,  1953. 


Osgood,  C.  E.  On  the  viiys  and  wherefores  of  E,  P,  and  A.  Journal  of 
Personality  and  Social  Psychology,  1969,  J^(3)  , 194-199. 


-191- 


r 


Osgood,  C.  E.,  & Suci,  G.  J.  Factor  analysis  of  moaning.  Journal  of 
Experimental  Psychology , 1955,  325-338. 

Semantic  Differential  Items,  Forced  Choice  Items, 

Questionnaire  Theory  and  Development 

Journal  of  Experimental  Psychology,  p.  338  R-M 


* Osgood,  C.  E.,  Suci,  G.  J.,  & Tannenbaum,  P.  H.  The  measurement  of  meaning. 
Urbana,  111.:  University  of  Illinois  Press,  1957. 

Semantic  Differential  2 

ORA 


Ostrom,  T.  M.  Item  construction  in  attitude  measurement.  Public  Opinion 
Quarterly  , 1971-1972,  > 593-600. 


Owens,  W.  A.  The  form  of  items  and  the  distribution  of  false  positive 
scores  on  a neurotic  inventory.  Proceedings  of  the  Iowa  Academy  of  Science, 
1946,  285-288. 


Owens,  W.  A.  Item  form  and  'false-positive'  responses  on  a neurotic  in- 
ventory. Journal  of  Clinical  Psychology,  1947,  3,  264-269. 


Pace,  C.  R.  A sitviations  test  to  measure  social-political-economic 
attitudes.  Journal  of  Social  Psychology,  1939,  J_0,  331-344. 


Pack,  E,  C 
content  ol 
141-144. 


The  effects  of  testing  upon  attitude  towards  the  method  and 
instruction.  Journal  of  Educational  Measurement,  1972,  9(21, 


Palmer,  G.  b.  Factors  in  the  variability  of  respoiise  in  enumeraiive  st. idles. 
Journal  of  the  American  Statistical  Association,  1943,  J^(2221,  143-270. 


-192- 


J 


r 


* Paradise,  1..  M. , & Blankenship,  A.  B.  Depth  questioning.  Journal  of 
Market ine . 1951,  1^,  274-288. 


Interviews,  Literature  Review,  Questionnaire  Theory 
and  Development 

1,  17 

ORA 

R-M 

Parducci,  A.  Direction  of  shift  in  the  judgment  of  a single  mind. 

Journal  of  Experimental  Psychologv,  1956,  51,  169-178. 

Psychophysical  Measures 

18 

Psych,  Abst. , 31,  #2378 

A-N 

Parducci,  A.  Range-frequency  compromise  in  judgment. 

Monographs,  1962,  77(2,  No.  565). 

Psychological 

Parducci,  A.,  & Perrett,  L.  F.  Category  rating  scales:  effects  of  relative 

soacine  and  frequency  of  stimulus  values.  Journal  of  Experimental  Psychol- 

ogy.  1971,  427-452. 

N/A 

N/A 

N/A 

T-H 

Parker,  C.  A.,  Wright,  E.  W.,  & Clark,  S.  G.  Questions  concerning  the 

interview  as  a research  technique.  Journal  of  Educational  Research,  1957, 

215-222. 

Reliability,  Interviews,  Close-Ended  Items 

1 

Psych.  Abst.,  33,  #3802 

A-M 

Parker,  G.  V.,  & Veldman,  D,  J.  Item  factor  structure 

Check  List.  Educational  and  Psychological  Measurement. 

of  the  Adjective 
1969,  23,  605-613. 

Adjectives,  Check  List 

6 

ORA 

R-M 

Parris.  H.  L.  A comparative  study  of  forced-choice  and 

check-list  ratings 

of  Air  Force  R.O.T.C.  instructors.  (Doctoral  dissertation,  Ohio  State 
University)  Columbus,  Ohio:  Graduate  Division,  1951, 

N/A 

N/A 

N/A 

T-H 

-193- 


Parry,  H.  J.,  & Crossley,  H.  M.  Validity  of  responses  to  survey  questions. 
Public  Opinion  Quarterly.  1950,  1^,  61-80. 


Parsons,  H.  Man-machine  system  experiments.  Baltimore:  The  Johns  Hopkins 
Press,  1972. 


Parten,  M.  B.  Surveys,  polls,  and  samples:  Practical  procedures.  New 
York:  Harper,  1950. 


Pa sane  11a,  A.  K.  , et  al.  Bibliography  of  test  criticism  (ERIC  Document 
Reproduction  Service,  ED  039  395).  New  York:  College  Entrance  Examination 
Board,  1967. 

Achievement  Measures  18 

ERIC  Document  Reproduction  Service,  ED  039  395  A-N 


Patterson,  A.  C.  The  questionnaire  as  a means  of  educational  research. 
III.  The  construction  and  administration  of  a questionnaire.  Scottish 
Educational  Journal,  1942,  2^,  708. 


Paul,  L.  E.  The  construe  ion  of  interval  scales  for  measuring  the  accept- 
ability of  clothing  and  equipment  in  field  tests.  Quartermaster  R&E  Field 
Evaluation  Agency , 1960.  Technical  Report  R-4,  Project  No.  07-98-05-001. 


* Pauli,  D.  Reliability  of  ordinal  scales  derived  by  ego-involved  judges. 
Journal  of  Social  Psychology,  1968,  143-144. 

Response  Bias,  Paired  Comparison  Items,  Ranking  12,  2,  10 

Psych.  Abst. , 44 , #1758  A-H 


Payne,  S.  L.  Thoughts  about  meaningless  questions.  Public  Opinion  Quarter- 
1^,  1950  (b),  14,  687-696. 


-194- 


* Payne,  S.  h.  Case  study  In  question  complexity. 

1950  (a) , 13,  653-658. 

Clarity,  Interviews,  Multiple  Choice  Items, 

Investigator  Error 

Potter,  Sharpe,  Hendee , & Clark,  1972  (rev.) 

* Payne,  S.  L.  The  art  of  asking  questions.  Princeton,  N.  J.:  Princeton 

University  Press,  1951  (Revised  Ed.,  1963). 

17 

Textbook 

Potter,  Sharpe,  Hendee  and  Clark,  1972  ^ ^ 


Public  Opinion  Quarterly, 

3c  , 4 
R-H 


* Payne,  S.  L.  Combination  of  survey  methods, 
1964,  I,  61. 

Interviews,  Close-Ended  Items 
ORA 


Journal  of  Marketing  Research, 


1,  17 


R-N 


* Payne,  S.  L.  Are  open-ended  questions  worth  the  effort?  Journal  of 
Marketing  Research,  1965,  2,  417-418. 

Close-Ended  Items,  Open-Ended  Items,  Interviews  2 


* Peabody,  D.  TWO  components  in  bipolar  scales:  Direction  and  extremeness 
Psychological  Review,  1962,  §2,  65-73. 

8 

Scaling,  Scoring 

Psychological  Review,  p.  65-73.  ^ 


Peabodv  D.  Attitude  content  and  agreement  set  in  scales 
ianism,  dogmatism,  anti-semitism  and  economic  conservatism 

Abnormal  and  Social  Psychology.  1963,  1-11. 


of  authoritar- 
Journal  of 


-195- 


Peabody,  D.  Models  for  estimating  content  and  set  components  in  attitude 
and  personality  scales.  Educational  and  Psychological  Measurement.  196A, 
2A,  255-69. 


Peabody,  D.  Authoritarianism  scales  and  response  bias.  Psychological 
Bulletin,  1966,  11-23. 


* Pearlin,  L.  1.  The  appeals  of  anonymity  in  questionnaire  response.  Publ ic 
Opinion  Quarterly.  1961,  ^(4),  640-647. 

Respondent's  Motivation,  Anonymous  Respondent  11 

Bureau  of  the  Census,  #7111033901  A-H 


Pedersen,  D.  M. , & Breglio,  V.  J.  The  correlation  of  two  self -disclosure 
inventories  with  actual  self -disclosure : A validity  study.  Journal  of 
Psychology , 1968,  6^,  291-298. 


Peek,  R.  M.  A comparison  of  four  scaling  techniques  in  the  development 

of  a behavior  rating  scale.  Dissertation  Abstracts,  1968,  ^(3-B),  1177-1178. 

Rating  Scales,  Reliability  2 

Dissertation  Abstracts,  ^(3-B) , pp.  1177-1178  (Rev.)  A-M 


Pelz,  D.  C.  The  influence  of  anonymity  on  expressed  attitudes.  Human 
Organization,  1959,  (2),  88-91. 


* Penner , L.,  Homant,  R.,  & Rokeach,  M.  Comparison  of  rank-order  and 
paired-comparison  methods  for  measuring  value  systems.  Perceptual  and 
Motor  Skills,  1968,  27(2),  417-418. 

Paired  Comparison  Items,  Reliabilities,  Ranking  2 

ORA  R-H 


Peouts,  J.  H.,  & Rader,  G.  E.  The  influence  of  interviewer  character- 
istics on  the  initial  interview.  Social  Casework.  1962,  ^(10),  548-552. 


-196- 


Perloff,  R.  Consumer  analysis.  Annual  Review  of  Psychology,  1968,  1^, 
437-466. 


Perrine,  M.  W.  Some  LiEluences  of  verbal  reinforce.ne  iL  upon  reference 
scale  formation  and  discrimination.  Dissertation  Abstracts,  1959, 
2172-2173. 

Psychophysical  Measures,  Investigator  Error, 

Respondent's  Motivation 

Dissertation  Abstracts,  19 , pp.  2172-2173  A-M 


Perry,  D.  K.  Forced-choice  vs.  L-I-D  response  items  in  vocational 
interest  measurement.  Journal  of  Applied  Psychology , 1955,  256-262. 

Forced  Choice  Items,  Multiple  Choice  Items  2 

Psych.  Abst. , 30 , #4734  A-H 


Personnel  Research  Section,  PRPB,  Adjutant-General  s Office.  The  forced- 
choice  technique  and  rating  scales.  American  Psychologist,  1946,  JL,  267. 

Rating  Scales,  Forced  Choice  Items  2 

American  Psychologist.  _1,  p.  267  A-M 


Peryam,  D.  R. , Polemis,  B.  W.,  Kamen,  J.  M. , Eindhoven,  J.,  & Pilgrim, 

Food  Preferences  of  Men  in  the  U.  S.  Armed  Forces.  Washington,  D.C. 
Department  of  the  Army,  Quartermaster  Food  and  Container  Institute,  1960. 


N/A 


N/A 


N/A 


T-M 


Peters,  D.  L.,  & McCormick,  E.  J.  Comparative  reliability  of  numerically 
anchored  versus  job-task  anchored  rating  scales.  Journal  of  Applied  j*sj^- 
chology , 1966,  50,  92-96. 

Reliability,  Response  Alternatives,  Rating  Scales  3f 

Psych.  Abst. , #4642  A-H 


-197- 


Peters,  D.  L.,  & Messier,  V.  The  effects  of  question  sequence  upon  objec- 
tive test  performance.  The  Alberta  Journal  of  Educational  Research.  1970, 
16,  253-265. 


Peters,  H.  N.  A multiple  choice  supraordinality  test.  Journal  of 
Clinical  Psycholosy , 1958,  j^,  416-418. 


Peterson,  F.  A.  A technique  for  the  detection  of  blind  checking  in  ques- 
tionnaire research.  Educational  and  Psychological  Measurement,  1961.  21 
361-362.  • ~ 

Response  Bias  12 


Pettigrew,  T.  F.  The  measurement  and  correlates  of  category  width  as  a 
cognitive  variable.  Journal  of  Personality,  1958,  26,  532-544. 


Phifer,  M.  K.  Influence  of  the  process  of  discrimination  on  the  selection 
of  statements  for  an  attitude  scale.  Public  Opinion  Quarterly.  1971-1972 
35(4),  601-605. 


Attitude  Measures,  Card  Sorts,  Investigator  Error. 
Scaling 


3f,  10,  13 


Phillips,  M.  Problems  of  questionnaire  investigation.  Research  Quarterly 
1941,  12,  528-537. 


Pickett,  G.  D.  A comparison  of  translation  and  blank-filling  as  testing 
techniques . Available  from-Subscr iption  Department,  Oxford  University 
Press,  1968. 


Pierre,  K.  J.  Social  desirability  response  set  in  clinical  interviewing. 
Dissertation  Abstracts  International.  1971,  32,  1856-1857. 


L98- 


* Pilgrim,  F.  J.,  & Wood,  K.  R.  Comparative  sensitivity  of  rating  scale 
and  paired  comparison  methods  for  measuring  consumer  preference.  Food 
Technology . 1955,  385-387. 

Rating  Scales,  Paired  Coiuparison  Items  2 

Psych.  Abst.  , 30,  #5408  A-H 


Pintner,  R.,  & Forlano,  G.  The  influence  of  attitude  upon  scaling  of 
attitude  items.  Journal  of  Social  Psychology,  1937,  8,  39-45. 

Pippette,  G.  L.  An  experiment  witl>  college  questionnaires.  Journal  of 
Marketing,  1940,  _5,  122-124. 

Check  List,  True-False  Items,  Open-Ended  Items  2 

ORA  R-N 


Politz,  A.  Questionnaire  validity  through  the  opinion-forming  question. 
The  Journal  of  Psychology,  1953,  11-15. 


Pollack,  1.  Iterative  techniques  for  unbiased  rating  scales.  Quarterly 
Journal  of  Experimental  Psychology,  1965,  1_7(2)  , 139-148. 


Pollaczek,  P.  P.  A study  of  malingering  on  the  CVS  abbreviated  individual 
intelligence  scale.  Journal  of  Clinical  Psychology,  1952,  75-81. 


Postman,  L.,  & Crutchfield,  R.  S.  The  interaction  of  need,  set,  and 
stimulus-structure  in  a cognitive  task.  American  Journal  of  Psychology, 
1952,  196-217. 


Postman,  L.  , & Zimmerman,  C.  Intensity  of  attitude  as  a 
decision-time . American  Journal  of  Psychology,  1945,  58 , 


determinant  of 
510-518. 


-199- 


* Potter,  D.  R. , Sharpe,  K.  M.,  Hendee,  J.  C.,  & Clark,  R.  N.  Question- 
naiies  for  research;  An  annotated  bibliography  on  design,  construction  and 


use . Portland,  Oregon:  Department  of  Agriculture,  Pacific  Northwest  Forest 
and  Range  Experiment  Station,  1972.  Forest  Service  Paper  PNW-140. 

Bibliography  16 

Potter,  Sharpe,  Hendee  and  Clark,  1972  (Rev.)  A-H 


* Potter,  G.  S,,  & Tinkleman,  V.  Anchor  effects  in  the  development  of  be- 
havior rating  scales.  Educational  and  Psychological  Measurement,  1970,  ^ 
(2),  311-318. 


Response  Bias,  Response  Alternatives 


3f,  12 


Powell,  W.  R.  Reappraising  the  criteria  for  interpreting  informal  inven- 
tories. International  Reading  Association  Conference  Proceedings.  1968, 
13(14),  100-109. 


PrifeT,  E.  P.,  Barrett,  G.,  & Svetlik,  B.  Uses  of  questionnaires  in  job 
evaluation.  Journal  of  Industrial  Psychology,  1965,  3(4),  91-94. 


* Prien,  E.  P.,  Otis,  J.  L.  , Campbell,  J.  R.,  & Saleh,  S.  Comparison  of 
methods  of  measurement  of  job  attitudes.  Journal  of  Industrial  Psychology, 
1964,  2(4),  87-97. 

Close-Ended  Items,  Open-Ended  Items  2 

Psych.  Abst. . 40 . #11553  A-M 


Proctor,  C.  H.  Variations  in  response  errors  induced  by  changing  instruc- 
tions to  enumerators.  Proceedings  of  the  American  Statistical  Association. 
Social  Statistics  Section,  1965,  51-55. 


Proctor,  C.  H.  Reliability  of  a Guttman  scale  score.  Proceedings  of  tiie 
American  Statistical  Association.  Social  Statistics  Section,  1971,  14 , 
348-390. 


-200- 


Profession  Research  Associates.  A comparative  study  of  mail  questionnaire 
techniques . Chicago:  Putnam  Pub.  Co.,  1959. 


Proshansky,  H.  M.  A projective  method  for  the  study  of  attitudes. 
Journal  of  Abnormal  and  Social  Psychology.  1943,  393-395. 


Prothro,  E.  T.  The  eff-’ct  of  strong  negative  attitudes  on  the  placement 
of  items  in  a Thurstone  scale.  Journal  of  Social  Psychology.  1955,  41, 
11-18. 

Attitude  Measures,  Investigator  Error,  Scaling,  10,  12 

Response  Bias 

Journal  of  Social  Psychology,  41,  (Rev.)  R-M 


Prothro,  E.  T.  Personal  involvement  and  item  displacement  on  Thurstone 
scales.  Journal  of  Social  Psychology.  1957,  4^,  191-196. 

Response  Bias  12 

Psych.  Abst . , 33 , #10.179  A-H 


Pugh,  R.  C.  Empirical  evidence  on 
technique  to  Likert  items.  Journal 


54-56. 


the  application  of  Lord's 
of  Experimental  Education, 


sampling 
1971,  39(3) , 


Pusar,  A.  An  indirect  approach  to  the  measurement  of  vocational  interests. 
(Master's  Thesis,  Syracuse  University),  Syracuse,  New  York:  1972. 


Pyrezak,  F.,  Jr.  Objective  evaluation  of  the  quality  of  multiple-choice 
test  items.  (Doctoral  dissertation,  University  of  Philadelphia)  Philadelphia 
Pa.:  1972. 


-P^/rxzak,  F.  Use  of  similarities  between  stems  and  keyed  choices  in  multiple 
choice  "li.rems- _ Paper  presented  at  Annual  Meeting  of  National  Council  for 
Moasuret^ient  in  fidlreatipn,  1973. 


Quereshi,  M.  Y.  The  development  of  the  Michill  Adjective  Rating  Scale 
(MARS).  Journal  of  Clinical  Psychology,  1070,  ^(2),  192-196. 

* Quinn,  J.  L.  Performance  appraisals:  The  relation  between  ratings  and 
selected  characteristics  of  the  rating  dyads.  Dissertation  Abstracts.  1967, 
28B,  2172. 

Response  Bias,  Raters  9,  12 

ORA  A-M 

\ 

* Quinn,  R.  P.  Conformity,  personality,  and  the  extraneous  third  variable- 
acquiescence  response  set.  Dissertation  Abstracts.  1963,  ^(6),  2606. 

Response  Bias,  Personality  Measures  12 

Dissertation  Abstracts,  24 , p.  2606  (Rev.)  A-H 


Quinn,  S.  B.,  & Belson,  W.  A.  The  effects  of  reversing  the  order  of  pre- 
sentation  of  verbal  rating  scales  in  survey  interviews"  London:  Reprint 
Series,  Survey  Research  Centre,  London  School  of  Economics  and  Political 
Science . 

Instrument  Format,  Rating  Scales,  Response  Alternatives  3b 

ORA  R-H 


* Rainio,  K.  The  effect  of  the  selection  situation  on  responses  to  question- 
naires. Acta  Psychologica , 1956,  _12,  244-246. 

Investigator  Error  7 

Psych.  Abst.  , #999  A-M 

* Rambo,  W.  W.  Equal-appearing  interval  scales,  own-attitude,  and  experimental 
instructions:  A note.  Perceptual  and  Motor  Skills.  1968,  ^(3,  Part  I) 
839-842. 

Attitude  Measures,  Scaling,  Investigator  Error  7,  10 

ORA  R-H 


-202- 


— . — — — 


Ramsey,  J.  0.  The  eftect  of  number  of  categories  in  rating  scales  on 
precision  of  estimation  of  scale  values.  Psychome tr ika , 1973,  38(U)  , 
513-532. 

Response  Alternatives 

Psychome tr ika , 38 , p.  513  R-NA 


Ranta,  T.  J.  Social  attitudes  and  response  style.  Educational  and  Psycho- 
logical Measurement,  1961,  ^(3),  543-557. 


Rapaport,  G.  M.,  & Berg,  I.  A.  Response  sets  in  a multiple-choice  test. 
Educational  and  Psychological  Measurement,  1955,  ^5,  58-62. 

1 2 

Response  Bias 

Psych . Abst . , 30 , #1630 


Rappeport,  M.  A.  Comments  on  'an  experimental  study  of  payments  to 
respondents.'  Public  Opinion  Quarterly,  1971,  15(3),  335. 


* Rappard,  C.  A.  Enkele  aspecten  van  de  juistheid  der  personeelsbeoordel ingen . 
(Some  aspects  of  the  correctness  of  personnel  ratings) . Psycholgische 
Achergronden , 1950,  1(11/12),  98-107. 

12 

Raters 


Psych.  Abst. , 26,  #3635 


Rau,  L.  Variability  in  response  to  words;  An  investigation  of  stimulus- 
ambiguity.  American  Journal  of  Psychology,  1958,  TL,  338-349. 


Ray,  W.  '3.,  Hundleby,  J.  D.  , & Goldstein,  D.  A.  Test  skewness  and  kurtosis 
as  functions  of  item  parameters.  Psychometr ika , 1962,  27_,  39-47. 


-203- 


T 


Razran,  G.  H.  S.  A quantitative  study  of  meaning  by  a conditioned  salivary 
technique  (semantic  conditioning).  Science . 1939,  89-90. 


Redl,  F.  On  oxamination  fear.  New  Era . 1936,  ]J_,  73-75. 


Reel,  W,  D.  A study  of  the  relationship  between  personal  information  and 
social  skills  rating.  Dissertation  Abstracts.  1961,  ^(4),  1286. 


Rees,  D.  W.,  & Copeland,  N.  K.  The  effects  of  serial  position  in  checklist 
design.  USAF  WADC  Technical  Note  59-552,  1959. 

Check  List,  Instrument  Length,  Investigator  Error  3g,  13,  7 

Psych.  Abst.  , 3^,  #1369  A-H 

Reeves,  J.  W.  What  is  occupational  success?  Postscript  1970.  Occupa tion- 
a_l  Psychol ogy , 1970,  219-220. 


Reich,  J.,  & Sherif,  M.  Ego-involvement  as  a factor  in  attitude  assess- 
ment by  the  own  categories  technique.  Norman,  Oklahoma:  University  of 
Oklahoma,  Institute  of  Group  Relations,  1963.  (Mimeographed) 


Reiling,  E.,  & Taylor,  R.  A new  approach  to  the  problem  of  changing 
initial  responses  to  multiple  choice  questions.  Journal  of  Educational 
Measurement , 1972,  £(1)  , 67-70. 


Remmers,  H.  H.  The  validity,  reliability  and  halo  effect  of  human  judg- 
ments in  defined  situations.  Psychological  Bulletin.  1933,  30,  577. 

Investigator  Error,  Validity,  Reliability  12,  15 

Psychological  Bulletin.  30,  p.  577  (Rev.  from  rept.)  A-N 


-204- 


Rcramcrs,  H.  H.  Introduction  to  opinion  and  attitude  measurement.  New 
York:  Harper,  1955. 


Remmers,  H.  H.,  Gage,  N.  L.,  & Runimel,  J.  F.  A practical  introduction 
to  measurement  and  evaluation.  New  York:  Harper  and  Row,  1965. 

Textbook  16 

ORA  T-H 


Remmers,  H.  H.,  & Remmers,  E.  M.  The  negative  suggestion  effect  of  true- 
false  examination  questions.  Journal  of  Educational  Psychology,  1926,  17  , 
52-56. 

Remmers,  H.  H.,  & Silance,  E.  B.  Generalized  attitude  scales.  Journal 
of  Social  Psychology,  1934,  298-312. 


Remmers,  H.  H.,  et  al.  An  experimental  study  of  the  relative  difficulty  of 
true-false,  multiple  choice,  and  incomplete  sentence  types  of  examination 
questions.  Journal  of  Educational  Research,  1923,  1^,  366-371. 


Remmers , H.  H.,  et  al.  Further  studies  in  attitudes.  Series  III.  Purdue 
University  Studies  in  Higher  Education,  1938,  _34 , 1-151. 

N/A  N/A 

N/A  T-M 

* Rennick,  V.  G.,  Grupe , J.  E.,  Reich,  E.  L.,  & Sewell,  M.  R.  Exploratory 
study  of  rating  procedures  used  to  analyze  material  received  on  parents' 
reports.  Union  College  Stud.  Character  Res.,  1954,  101-124. 

Ranking,  Rating  Scales  2 

Psych.  Abst . , 29 , #3751  A-M 


Reuschling,  T.  L.,  & Etzel,  M.  J.  Disappearing  data  source.  Business 
Horizons , 1973,  1^,  17-22. 


Reuss,  C.  F.  Differences  between  persons  responding  and  not  responding 
to  mail  questionnaires.  American  Sociological  Review,  1943,  &,  433-438. 


* Reynolds,  W.  H.  Some  empirical  observations  on  a ten-point,  poor-to- 
excellent  scale.  Journal  of  Marketing  Research,  1966,  3,  388. 

Scaling,  Data  Analysis  2,  8 

Journal  of  Marketing  Research.  1966,  3,  p.  388  R-M 


Rice,  S.  A.  Contagious  bias  in  the  interview.  American  Journal  of 
Sociology,  1929,  35(3),  420-423. 


Richards,  E.  A.  A commercial  application  of  Guttman  attitude  scaling 
techniques.  Journal  of  Marketing,  1951,  166-173. 

Rating  Scales  17,  14 

ORA  R-N 


-206- 


Richards,  P.  H.  Analysis  of  the  Delphi  Survey  1972-1973.  AOTi,  National 
Invitational  Conference  Redesigning  Teacher  Education  Pre-Conference 
Input,  1973. 


Richardson,  M.  W.  An  empirical  study  of  the  forced-choice  performance 
report.  American  Psychologist,  1949,  4,  278-279. 


Richardson,  M.  W. , & Ruder,  G.  F.  Making  a rating  scale  that  measures. 
Personnel  Journal,  1933,  1^,  36-40. 


* Richardson,  S.  A.  The  use  of  leading  questions  in  non-schedule  interviews. 
Human  Organization,  1960,  1^(2),  86-89. 

Clarity,  Interviews,  Investigator  Error  3g,  13 

Bureau  of  Census,  #7111006701  A-H 


Richardson,  Dohrenwend,  & Klein.  Interviewing,  its  forms  and  functions. 
New  York:  Basic  Books,  1965. 


Riker,  B.  L.  A comparison  of  methods  used  in  attitude  research.  Journal 
of  Abnormal  and  Social  Psychology,  1944,  24-42. 


Riker,  B.  L.  Comparison  of  attitude  scales-  a correction.  Journal  of 
Abnormal  and  Social  Psychology,  1945,  102-103. 


Riley,  J.  W. , Jr.,  & Toby,  J.  Sociological  studies  in  scale  analysis. 
New  Brunswick,  N.  J.:  Rutgers  University  Press,  1954. 


Riley,  M.  W.  Sociological  research.  New  York:  Harcourt,  Brace  and  World, 
1963. ’ 


-207- 


Riley,  M.  W.,  & Toby  J.  Subject  and  object  scales:  a sociological  appli- 
cation. American  Sociological  Review,  1952,  J^,  287-296. 


Riley,  M.  W.,  et  al.  Sociological  studies  of  scale  analysis.  New  Bruns- 
wick: Rutgers  University  Press,  1954. 


Rimoldi,  H.  J.  A.,  & Devans , J.  R.  Some  considerations  on  scaling  pro- 
cedures. Perceptual  and  Motor  Skills,  1960,  1_1,  207-213. 


Rimoldi,  H.  J.  A.,  & Hormaeche , M.  The  law  of  comparative  judgment  in 
successive  intervals  and  graphic  rating  scale  methods.  Psychometrika , 1955, 
2^,  307-318. 


Rippey,  R.  M.  Scoring  and  analyzing  confidence  tests.  (ED  060  070.  MF 
and  HC  available  from  EDRS.) 


Robertson,  D.  W. 
evaluation  system. 
SRM-72-10. 


Source  documents  for  the  automated  enlisted  performance 
USN  Personnel  and  Training,  1972.  Research  Memorandum 


Robertson,  R.  J.,  & Malchick,  D.  L.  The  reliability  of  global  ratings 
versus  specific  ratings.  Journal  of  Clinical  Psychology,  1968,  24(2)  . 
256-258. 


Robinson,  J.,  & Shaver,  P.  Measures  of  social  psychological  attitudes, 
(Appendix  B to  measures  of  political  attitudes).  Ann  Arbor:  Survey 
Research  Center,  Institute  for  Social  Research,  University  of  Michigan, 
1969. 


Robinson,  J.  P.,  Rusk,  J.  C.,  & Head,  K.  B.  Measures  of  political  atti- 
tudes . Ann  Arbor,  Michigan:  University  of  Michigan,  Institute  for  Social 
Research,  1968. 


-208- 


Robinson,  R.  A.  How  to  boost  returns  from  mail  surveys.  Printers'  Ink, 
1952,  ^(6),  36. 


Rodger,  A.  The  worthwhileness  of  the  interview.  Occupational  Psychology, 
London,  1952,  101-106. 

Interviews  1 

Psych.  Abst.  , #701  A-N 


Rodgers,  F.  A.  Test-item  format  preferences  of  elementary-school  pupils. 
Elementary  School  Journal,  1966,  ^(1),  45-49. 


Roeber,  E.  C.  Comparison  of  seven  interest.^inventories  with  respect  to 
word  usage.  Journal  of  Educational  Rescay6h,  1948,  8-17. 

Clarity,  Interest  Measures  ' 4 

ORA  R-N 


Roehr , G.  A.  Effective  techniques  in  increasing  response  to  mailed 
questionnaires.  Public  Opinion  Quarterly,  1963,  299-302. 


Rogers,  S.  The  anchoring  of  absolute  judgments.  Archives  of  PsycholoRv. 
New  York,  1941,  No.  261. 

Rating  Scales,  Psychophysical  Measures  3f 

ORA  R-H 


Rohila,  P.,  Shanhdhar,  S.  C.,  & Sharma,  V.  Comparison  of  a non-verbal 
interest  inventory  with  its  verbal  equivalent.  Journal  of  Psychological 
Researches , 1966,  J^(l)  , 32-36. 


3"  , 4 

~'o  > 


Instrument  Format,  Clarity 
Psych.  Abst.  , #7719 


A-M 


1 


Rohrer  , J.  H.  A cr ossva 1 Ida tion  of  the  proverbial  attitudes  test  and  the 
psychodynap.icn  underlying  the  proverbs  identifying  the  character  and  person- 
ality disordpt...  Washington,  D.  C.:  Georgetown  University,  1961.  Contract 

NONR  153007. 


Person- lity  Measures 


DDC,  #262  911 


Rokeach,  M.  Attitude  as  a determinant  of  distortions  in  recall.  Journa 1 
of  Abnormal  and  Social  Psychology,  1952,  482-488. 

Personality  Measures  18 

Psych.  Abst.  , 2J_,  #2533  A-N 


Rokeach,  M.  The  double  agreement  phenomenon:  Three  hypotheses.  Psychoiogi 
cal  Review,  1963,  22,  304-309. 


Rokeach,  M.  The  role  of  values  in  public  opinion  research.  Publ ic 
Opinion  Quarterly,  1968-69,  32(4),  547-559. 


Rokeach,  M.  The  nature  of  attitudes.  International  Encyclopedia  of  the 
Social  Sciences,  1972. 


Roman,  H.  S.  Semantic  generalization  in  formulation  of  consumer  attitudes. 
Journal  of  Marketing  Research.  1969,  369-73. 


Rommetveit,  R.,  & Svalheim,  R.  Some  halo  effects  in  perception  of  geo- 
metrical patterns.  Nordisk  Psykologi,  1959,  H.,  11-24. 

Response  Bias  12 

Psych.  Abst. , #535  A-N 


Roper,  E.  Wording  of  questions  for  the  polls.  Public  Opinion  Quarterly. 

1940,  4,  129-130. 

Question  Stem  3g 

ORA  R-N 

Roper,  E.  Checks  to  increase  polling  accuracy.  Public  Opinion  Quarterly. 

1941,  5,  87-90. 

* Rorer,  L.  C.  The  great  response-style  myth.  Psychological  Bulletin,  1965, 
^(3),  129-156. 

Response  Bias,  Question  Stem  12,  3g 

Bureau  of  Census  A-M 


Rosander,  A.  C.  The  Spearman -Brown  formula  in  attitude  scale  construction. 
Journal  of  Experimental  Psychology,  1936,  j^,  486-495. 


Rosander,  A.  C.  An  attitude  scale  based  upon  behavior  situations. 
Journal  of  Social  Psychology,  1937,  _8,  3-15. 


Rose,  A.  M.  A research  note  on  experimentation  in  interviewing.  American 
Journal  of  Sociology,  1945,  ^(6),  509-586. 


Rosen,  H.,  & Rosen,  R.  A.  The  validity  of  "undecided"  answers  in  question- 
naire responses.  Journal  of  Applied  Psychology,  1955,  178-181. 

Investigator  Error  12 

Potter,  Sharpe,  Hendee , and  Clark,  1972  A-M 


* Rosen,  N.  A.  Anonymity  and  attitude  measurement.  Public  Opinion  Quarterly, 
1960,  24,  675-79. 


11 


Anonymous  Respondent 

Potter,  Sharpe,  Hendee,  and  Clark,  1972 


A-M 


r — 1 


Rosenberg,  M.  J.,  Hovland,  C.  I.,  McGuide,  W.  J.,  Abelson,  R.  P.,  & Brehm, 
j.  W.  Attitude  organization  and  change.  New  Haven:  Yale  University  Press, 

1960. 


Rosenthal,  1.,  & Ferguson,  T.  S.  An  asymptotically  distribution-free 
multiple  comparison  method  with  application  to  the  problem  of  n rankings 
of  m objects.  British  Journal  of  Mathematical  and  Statistical  Psychology. 
1965,  18(2),  243-254. 

Ranking,  Data  Analysis  8 

Psych.  Abst. , #6090  A-N 


Rosenthal,  R.  Experimenter  effects  in  behavioral  research.  New  York: 
Appleton-Century-Crof ts , 1966. 


Rosenthal,  R.,  Kermit,  L.  ..,  Freidman,  C.  J.,  & Vikan,  L.  L.  Subjects' 
perception  of  their  experimenter  under  conditions  of  experimenter  bias. 
Perceptual  and  Motor  Skills,  1960,  JL_1,  325-331. 


Roskam,  E.  E.  The  method  of  triads  for  nonmetric  multidimensional  scaling. 
Netherlands  Journal  of  Psychology,  1970,  404-417. 


* Roslow,  S.,  & Blankenship,  A.  B.  Phrasing  the  question  in  consumer  re- 
search. Journal  of  Applied  Psychology,  1939,  ^(5),  612-622. 

Questionnaire  Theory  and  Development,  Clarity,  3c,  4,  9, 

Question  Stem,  Instrument  Format,  Investigator  Error  12,  14,  17 

Potter,  Sharpe,  Hendee , & Clark,  1972  (Rev.)  R-M 


* Roslow,  S.,  Wulfeck,  H.,  & Corby,  G.  Consumer  and  opinion  research:  Experi- 
mental studies  on  the  form  of  the  question.  Journal  of  Applied  Psychology, 
1940,  2A,  334-346. 

Clarity,  Check  List,  Response  Alternatives,  2,  3g,  4,  3a 

Open-Ended  Items 

Journal  of  Applied  Psychology,  24,  pp . 345-346.  R-M 


-212- 


r 


Rosnow,  R.  L.,  et  al.  More  on  the  reactive  effects 
attitude  research:  Demand  characteristics  or  subject 
al  and  PsycholoRical  Measurement,  1973,  33(1),  7-17. 


of  pretesting  in 
commitment?  Educa tion- 


Ross,  I.  Handling  the  neutral  vote  in  product  testing.  Journal  of  Mark^ 
ine  Research,  1969,  6,  221-222. 


* Ross,  P.  F.  Reference  groups  in  man-to-man  job  performance  rating. 
Personnel  Psychology.  1966,  1^(2),  115-142. 


Rating  Scales,  Raters 

Psych.  Abst.,  #11555  (Rev.  from  rept.) 


2 

R-M 


Ross,  R.  T.  Optimum  orders  for 
of  paired  comparisons.  Journal 


the  presentation  of  pairs  in  the  method 
of  Educational  PsvcholoRv,  1934,  375-382. 


Paired  Comparison  Items,  Response  Alternatives 


3b 


ORA 


R-N 


* 


Ross  R.  To  A linear  relationship  between  paired  comparisons  and  rank 
order,  'journal  of  Experimental  Psychology,  1955,  50,  352-354. 

Paired  Comparison  Items,  Ranking 


Psych.  Abst.  , #6569 


Rotter,  G.  S.  Cardina’  points  of  agreement  and  disagreement.  Jpur.nal 
of  Social  Psychology,  1^72,  ^ (2). 


Royer,  E.  B.  Some  recent  developments  in  test  construction.  Proceedings 
of  the  Oklahoma  Academy  of  Science,  136(16)  , 107-109. 


-213- 


* 


Rozeboom,  W.  W. , & Jones,  L.  V. 
method  of  psychometric  scaling. 


The  validity  of  the  successive  intervals 
Psychometrika , 1956,  165-183. 


Scaling  14 

Psych.  Abst.,  M,  #4039  A-M 


Rubin,  D.  B.  Matching  to  remove  bias  in  observational  studies  (AD716  441) 
(Fiche  S.R821970).  Cambridge,  Mass.:  Harvard  University,  Department  of 
Statistics,  1970. 

N/A  N/A 

N/A  T-H 


Ruch,  F.  L.  Effects  of  repeated  interviewing  on  the  respondent's  answers. 
Journal  of  Consulting  Psychology,  1941,  ^(4),  179-182. 


Ruch,  F.  L.  A technique  for  detecting  attempts  to  fake  performance  on  a 
self-inventory  type  of  personality  test.  In  McNemar,  W.,  & Merrill,  M.  A., 
Studies  in  Personality.  New  York:  McGraw-Hill,  1942.  Pp . 229-234. 


Ruckmick,  C.  A.  The  uses  and  abuses  of  the  questionnaire  procedure. 
Journal  of  Applied  Psychology,  1932,  1^(4),  32-41. 


* Rugg,  D.  Experiments  in  wording  questions:  II.  Public  Opinion  Quarterly, 
1941,  5,  91-92. 

Investigator  Error  13,  3g 

ORA  R-M 


* Rugg,  D.  , & Cantril,  H.  The  wording  of  questions  in  public  opinion  polls. 
Journal  of  Abnormal  and  Social  Psychology,  1942,  469-495. 

Multiple  Choice  Items,  Forced  Choice  Items,  Open-Ended 

Items,  Instrument  Format,  Response  Bias  2,  3a,  3g,  12 

ORA  R-H 


-214- 


Riip.dqiiis!-, , E.  A.  Form  of  statement  in  personality  measurement.  Journal 
of  Educational  Psychology,  1940,  31,  135-147. 

Investigator  Error,  Personality  Measures,  Question  Stem  3g 

Jo..rivil  of  Educational  Psychology,  31^,  pp . 146-147  (Rev.)  R-NA 


P.iindquist,  E.  A.  The  forced-choice  technique  and  rating  scales.  Paper 
presented  at  the  American  Psychological  Association,  Annual  Meeting, 
Philadclpliia  , Pennsylvania,  1946. 


n/a 


N/A 


N/A 


T-H 


Rundquist,  E.  A.  Response  sets-a  note  on  consistency  in  taking  extreme 
positions.  Educational  and  Psychological  Measurement.  1950,  10(1),  97-99. 

Response  Bias  12 

ORA  R-II 


Rundquist,  E.  A.  Item  and  response  characteristics  in  attitude  and  oerson- 
alxty  assessment:  A reaction  to  L.  G.  Roper's  "The  great  response-style 
myth"  (AD  6^6  772).  Psycholociical  Bulletin.  1966,  6^,  166-177. 

Rc.sponse  Bias  j_2 

DDC  (Rev.)  A-N 


Rundquist,  E.  A.,  Winer,  B.  J.  , & Falk,  G.  11.  Follow-up  validation  of 
forced-choice  items  of  the  Army  Officer  Efficiency  Report.  American  Psveh- 
ologist.  1950,  5,  359.  — 


Rvjpley,  W.  H.  ERIC/RCS:  The  cloze  procedure.  Journal  of  Roadirp  1973 
16,(6),  496-502. 


-215- 


Sabers,  D.  L.,  & White,  G.  W.  The  effect  of  (differential  weighting  of 
individual  item  responses  on  predictive  validity  and  reliability  of  an 
aptitude  test.  Journal  of  Educational  Measurement.  1969,  6(2),  93-96. 


Sadacca,  R.  Dimensions  of  response  consistency  in  paired  comparisons. 
Research  Bulletin  62-14.  Princeton,  New  Jersey:  Educational  Testing  Service 
1962. 


N/A 

N/A 


N/A 

T-H 


* Saffir,  M.  A.  A comparative  study  of  scales  constructed  by  three  psycho- 
physical methods.  Psychometrika . 1937,  2,  179-198. 


Scaling,  Validity,  Psychophysical  Measures 
Psychometrika . 2,  p.  179  (Rev.  from  rept.) 


2,14 

R-H 


Salas,  R.  G.  A scale  of  satisfaction  with  Army  life.  Australian  Military 
Forces  Research  Report  No.  8-67,  III,  1967. 


N/A 

N/A 

N/A 

T-M 

Saltz,  E.,  Reece,  M. , & Ager  , J.  Studies  of  forced-choice  methodology: 
Individual  differences  in  social  desirability.  Educational  and  Psvcholr^oi - 
cal  Measurement,  1962,  ^(2),  365-370.  ^ ^ 


Samejima,  F.  A general  model  for  free  response  data.  Psychometrika 
1972,  37(1),  1-68.  — ^ 


Samiielson,  F.  Agreement  set  and  anticontent  attitudes  in  the  F scale: 

A reinterpretation.  Journal  of  Abnormal  and  Social  Psychology,  1964  , 68 , 
338-42. 


Santoscefano , S.  Forced-choice  acts  as  personality  measures.  Dissertation 
Abs tracts , 1958,  J_8,  287-288. 


Sattler,  J.  M.  Racial  experimenter  effects  in  experimentation,  testing, 
interviewing  and  psychotherapy.  Psychological  Bulletin,  1970,  73(2) , 
137-160. 

Investigator  Error  9 

Bauman,  Rogers,  and  Weiss,  1971  A-M 


Saunders,  C.,  & Ward,  G.  A comparison  of  differing  numbers  of  alternatives 
in  two  bipolar  scales.  Proceedings  of  the  West  Virginia  Academy  of  Science, 
1964,  No.  36,  187-190. 

Response  Alternatives,  Multiple  Choice  Items,  2,  3a 

Reliability 

Psycii.  Abst.  , #8271  A-H 


Saupe,  J.  L.  An  empirical  model  for  the  corroboration  of  suspected  cheat- 
ing on  multiple-choice  tests.  Educational  and  Psychological  Measurement, 
1960,  475-489. 

Response  Bias,  Investigator  Error  12 

Psych . Abst . , 35 , #3761  A-N 


Sawyer,  H.  G.  How  to  get  at  the  truth  with  market  surveys  by  mail. 
Sales  Management,  1959  , 98. 


-217- 


Sax,  G.,  & Carr,  A.  An  investigation  of  response  sets  on  altered  parallel 
forms.  Educational  and  Psychological  Measurement,  1962,  ^(2),  371-376. 


Response  Bias,  Instrument  Format,  Respondent's  12,  3c,  11 

Motivation 


Psych . Abs t . , 37 , #3267 


A-H 


Scates,  D.  E.,  & Yoemans,  A.  V.  Developing  an  obiective  item  questionnaire 
to  assess  the  market  for  further  education  among  employed  adults.  Washing- 
ington,  D.  C.;  American  Council  on  Education,  1950(a). 

Open-ended  Items  1,  5 

Psych.  Abst.  , 2^,  #545  A-H 


Scates,  D.  E.,  & Yoemans,  A.  V.  The  effect  of  questionnaire  form  on  course 
requests  of  employed  adults.  Washington,  D.  C.:  American  Council  on  Educa- 
tion, Research  Staff  on  Scientific  Personnel,  1950(b). 

Preference  Measures,  Check  List,  Open-Ended  Items,  1,  2,  3,  7, 

Military  Personnel,  Instrument  Format  14 

Psych  . Abs  t . , 26  , #228  (Rev.  from  rept.)  R-M 


Schaie,  K.  W.  On  the  equivalence  of  questionnaire  and  rating  data.  Psych- 
ological Reports,  1962,  lJ^(2) , 521-522. 

Response  Bias,  Data  Analysis  14,  12,  9 

Psych.  Abst. , 2Z>  #3160  A-N 


Schaie,  K.  W.  Scaling  the  scales:  Use  of  expert  judgment  in  improving 
the  validity  of  questionnaire  scales.  Journal  of  Consulting  Psychology, 
1963,  27(4)  , 350-357. 

14 


Validity,  Scoring 
Psych  Abs  t . , 38 , #2713 


A-M 


Journal  of 


Scheffe,  H.  An  analysis  of  variance  for  paired  comparisons. 
the  American  Statistical  Association,  1952,  A7,  381-400. 

Question  Stem,  Paired  Comparison  Items  3c,  2 

Psych.  Abst.  . ll,  #3148  A-N 


Schendel,  D.  E.,  Wilkie,  W.  L.,  & McCann,  J.  M.  An  experimental  Investiga- 
tion of  "Attribute  Importance"  In  Gardner  (Ed.),  Proceedings  of  the  Second 
Annual  Conference  of  the  Association  for  Consumer  Research,  College  Park, 

Md. , 1971,  243-255. 


Schlinger,  M.  J.  Cues  or  Q-technique.  Journal  of  Advertising  Research, 
1969,  9(3),  53-60. 

Card  Sorts,  Questionnaire  Theory  and  Development  R-N 

ORA 


Schlinger,  M.  J.  Responses  to  advertising-varieties  of  liking  and  dis- 
liking. Journalism  Quarterly,  1970,  ^(1),  31-40. 


Schmeiser,  C.  B.,  & Whitney,  D.  R.  The  effect  of  selected  poor  item- 
writing  practices  on  test  difficulty,  reliability,  and  validity:  A replica- 
tion.  Paper  presented  at  the  American  Educational  Research  Association 
meeting.  New  Orleans,  Louisiana,  1973. 


Schmidt,  C.  Semantic  problems  of  construction  and  analysis  of  a scientific 
questionnaire.  Bulletin  du  Centre  d'Etudes  et  Recherches  Psychotechniques , 
1967,  16(3),  265-271. 


Schmidt,  H.  D.  Uber  die  Zuverlassigkeit  von  Verhaltensbeur teilungen  durch 
Rating  Skalen.  (Reliability  of  behavior  through  rating  scales.)  Archiv  fiir 
die  gesamte  Psychologic , 1966,  118(1-2)  , 47-72. 


Schooler,  C.  A note  of  extreme  caution  on  the  use  of  Guttman  scales. 
American  Journal  of  Sociology,  1968,  74(3),  296-301. 


-219- 


Schubert,  S.  P.,  & Fiske,  D.  W.  Increase  of  item  response  consistency 
by  prior  item  response.  Educational  and  PsycholoRica 1 Measurement,  1973, 
33(1),  113-121. 


Schuessler,  K.  F.  Item  selection  in  scale  analysis.  American  Sociologi- 
cal  Review,  1952,  j^,  183-192. 


Investigator  Error,  Scoring 
Psvch.  Abst  ■ , 1]_,  #4751  (Rev.) 


8,  3g,  13 
A-H 


SchuliTuii.,  G.  I.  Asch  conformity  studies-conformity  to  experimenter  and/or 
to  the  group?  Sociome  try , 1967,  ^(1),  26-40. 


Schultz,  C.  B.  Response  set  factors  revealed  by  factor  analysis  of  an 
unconfounded  item  pool.  Seattle,  Wash.:  University  of  Washington,  1962. 

Response  Bias,  Personality  Measures  12 

ORA  R-M 


Schultz,  D.  G.  Item  validity  and  response  change  under  two  different 
testing  conditions.  Journal  of  Educational  Psychology.  1954,  45,  36-43. 


Schuman,  H.  The  random  probe:  A technique  for  evaluating  the  validity  of 
closed  questions.  American  Sociological  Review,  1966,  ^(2),  218-222. 

Clarity,  Forced  Choice  Items  4 

Bureau  of  Census,  #710000801  A-H 


Schuman,  H.  Attitudes  vs.  actions  versus  attitudes  vs.  attitudes.  Public 
Opinion  Quarterly,  1972,  _%(3)  , 347-354. 


Schuman,  H.  Effects  of  survey  question  wording  on  survey  results.  Ann 
Arbor,  Mich.;  University  of  Michigan,  School  of  Arts,  June  1974,  Contract 
GS- 39780, 

Question  Stem  3g 

ORA  A-H 


Schuman,  H.,  & Converse,  J.  M.  The  effects  of  black  and  white  interviewers 
on  black  responses  in  1968.  Public  Opinion  Quarterly.  1971,  ^(1),  44-68. 

Investigator  Error  • 9 

Bauman,  Rogers,  and  Weiss,  1971  (Rev.)  A-M 


Schutz,  R.  E.,  & Foster,  R.  J.  A factor  analytic  study  of  acquiescent 
and  extreme  response  set.  Educational  and  Psychological  Measurement.  1963, 
n,  435-447. 


Schwartz,  S.  H.,  & Tessler,  a.  C.  A test  for  a model  for  reducing  measured 
attitude-behavior  discrepancies.  Journal  of  Personality  and  Social  Psychol- 
ogy, 1972,  ^(2)  , 225-236 


Schyberger,  B.  W.  Study  of  interviewer  behavior. 

Research,  1967,  4,  32-35. 

Journal  of  Marketing 

Interviews,  Investigator  Error 

9,  12,  13 

ORA 

R-M 

Schyberger,  B.  W.  A case  against  direct  questions 

Journal  of  Advertising  Research,  6(4),  25-29. 

on  reading  habits. 

N/A 

18 

ORA 

A-NA 

Scott,  C.  Research  on  mail  surveys.  Journal  of  the  Royal  Statistical 
Society,  XXIV.  Series  A,  General,  1961,  143-195. 


-221- 


r 


* Scott,  W.  A.  Comparative  validities  of  forced-choice  and  single-stimulus 
tests.  Psychological  Bulletin,  1968,  7_0(4)  , 231-244. 


Response  Alternatives,  Forced  Choice  Items 
Psych ■ Abs t . , 44 , #00117 


2,  3a 
A-H 


Sears,  D.  0.,  &Abeles,  R.  P.  Attitudes  and  opinions.  Annual  Review  of 
Psychology , 1969,  253-288. 


Attitude  Measures,  Literature  Review 
ORA 


16 

R-N 


Sears,  R.  R.  Comparison  of  interviews  with  questionnaires  for  measuring 
mothers'  attitudes  toward  sex  and  aggression.  Journal  of  Personality  and 
Social  Psychology,  1965,  ^(1),  37-44. 


* Seashore,  R.  H.,  & Hevner,  K.  A time-saving  device  for  the  construction  of 
attitude  scales.  Journal  of  Social  Psychology,  1933,  4,  366-372. 


Card  Sorts,  Scaling,  Rating  Scales 
ORA 


2,  14 
R-H 


Secrist,  G.  51.  Assessment  of  motivational,  attitudinal,  and  satisfaction 
factors  related  to  performance  in  Air  Force  technical  training.  Lackland 
AFB,  Texas:  Air  Force  Human  Resources  Labora tor ies , June  1972. 


Military  Personnel 
ORA 


A-NA 

18 


Sedlacek,  W.  E.,  & Brooks,  G.  C.  Race  as  an  experimenter  effect  in  racial 
attitude  measurement  (ERIC  Document  Reproduction  Service,  ED  065  525). 
College  Park,  Md.:  Maryland  University,  Cultural  Study  Center,  1972. 


Investigator  Error 

ERIC  Document  Reproduction  Service,  ED  065  525 


9 

A-M 


-222- 


Seeley,  L.  C.,  Morton,  M.  A.,  & Anderson,  A.  A.  Exploratory  study  of  a 
Sequential  Item  Test.  USA  PRO  OCRD  Technical  Research  Note  No.  129,  1962. 

Military  Personnel,  Instrument  Format,  Clarity,  14,  8,  3c, 

Instrument  Length,  Scoring  7,  5 

Psych.  Abst.  . 38,  #945  A-M 


Seeman,  W.  "Subtlety"  in  structured  tests.  Journal  of  Consulting  Psychol- 
ogy, 1952,  16,  278-283. 


Sell,  T.,  et  al.  Research  methods  in  social  relations.  New  York:  Holt, 
Rinehart,  1959. 


Sellitz,  C.,  & Cook,  S.  Racial  attitude  as  a determinant  of  judgments  of 
plausibility  (AD  653  827).  Boulder,  Colo.:  University  of  Colorado,  Institute 
of  Behavioral  Science,  1965.  Contract  AF-AFOSR-436-63. 


Sellitz,  C.,  Jalioda,  M.  , Deutsch,  M.  , & Cook,  S.  W.  (Eds.)  Research 
Methods  in  Social  Relations.  New  York:  Holt,  1959. 


Sessions,  F.  Q.,  et  al.  The  development,  reliability  and  validity  of  an 
all  purpose  optical  scanner  questionnaire  form.  Public  Opinion  Quarterly, 
1966,  30,  423-428. 

Instrument  Format 

Bureau  of  Census,  #7111034901 


3g 

A-H 


'<■  Sgan,  M.  L.  Social  reinforcement,  socioeconomic  status  and  susceptibility 
to  experimenter  influence.  Journal  of  Personality  and  Social  Psychology. 
1967  , 5(2)  , 202-210. 

Investigator  Error,  Respondent's  Motivation  9,  10,  11 

OKA  A-H 


Shaffer,  1..  F.  Fear  and  courage  in  aerial  combat.  Journal  of  Consulting 
Psycho  logy , 1947  , 1 1 , 137-143. 


-223- 


Shapiro,  M.  J.  Discovering  interviewer  bias  in  open-ended  survey  responses. 
Public  Opinion  Quarterly,  1970,  34(3),  412-415. 


Shapiro,  S.,  & Eberhart,  J.  C.  Interviewer  differences  in  an  intensive 
interview  survey.  International  Journal  of  Opinion  and  Attitude  Research, 
1947,  1(2),  1-17. 

Interviews,  Investigator  Error  13 

Bureau  of  the  Census,  #7110001801  (Rev.  from  rept.)  R-H 


Sharon,  A.  T.  The  effect  of  instructional  conditions  in  producing  leniency 
on  two  types  of  rating  scales.  Dissertation  Abstracts.  1969,  ^(8-B) , 3124. 

Investigator  Error  7,  13 

ORA  A-M 


Sharp,  H.  The  mail  questionnaire  as  a supplement  to  the  personal  interview, 
American  Sociological  Review,  1955,  ^(6),  718. 

Intervie\^)s  1 

Psych . Abs  t . , 31 , #2753  A-N 


Shaw,  M.  E.,  Worthy,  M. , & Blum,  J.  M.  Effects  of  number  of  judges  upon 
scale  values  in  the  analysis  of  small  group  tasks.  Gainesville,  Florida; 
Office  of  Naval  Research,  1963.  Technical  Report  No.  2. 


N/A 

N/A 

N/A 

T-M 

Shaw,  M.  E.,  & Wright,  J.  M.  Scales  for  the  measurement  of  attitudes. 
New  York:  McGraw-Hill,  1967. 


Textb'ok 


16 


Slual.slfy,  1>.  ij,  Closc-cl  qiu'sUuiKS  Hoinol  iinc.s 
0)1 1 111  »n  ijiiari  ,t  I y . J_2  ( I ) , \Ti , 


ii'oic  VII  I f d llwiii 


open . 


I'ttl)  I 1 (- 


()|umi-KiuIciI  I I I'liia 


2, 


OKA 


U-M 


Sluvi  t s Icy  , F.  1$.  The  liil  liicncc 
'On: (> . Ihil)  I i c Oji  j n j Qii.i  r I c*  r I y , 


i>l  sii I) -cpn* s L i on s on  i n I t*rv  i (•■/era  ' 
I9A9,  jj3(2)  , ilO-'in. 


po  r To  rill - 


Slu‘<i  I s I i>y  , p . 
In  l5oj’,.-irl,  h. 
III.:  Ma  i ldiaiii 


h.  The  hara.ssed  respomleii  I : H.  |'i,  i o r v i ew  i nj- 

(K(l.)  , reaea  reli 

I’lilil  i all  ill)'  Co.,  lh()h  (a).  Pp . 


prae  i i ee.s  . 
f’h  i eaj'o , 


Slieal.sley,  P.  p.  Que 
lor  Aiiieriean  Newap.ipe  r 
Oh  io  , Jii  ly  , I q(i<)  (h)  . 


-il’Ji' ’ a i r(‘  (lea 
I’ll!)  1 i alu-ra  Aa 


i)',ii  and  cpiea  Mmi^w^d  j up  . 
aoejaljoii  Kt'aca'ireh  Di'iiiiiiar 


Talk  j) rep. 1 red 
, Co  I iiiiil.'iia  , 


Shellh.ias,  M.  Molimi  pieliires  Tor  all In.-; 

iiaea  for  op  i n i oii-a  I i i t iide  leaeareh  inlerview 
Ti{'\ , Pi.  I),  hM')-(,<)2. 


pieaenlal  ion;  l)e  ve  I opnuMi  I 
• jliiyiJ'i’lor.  I K(‘))orl  a , 


• ind 
IhhH, 


* .Slu'ii,  H . 'J'hc  iiiriiieiiei 
App  I i e^c I Payelioloj'v  , | h ' 

Kankiii)',  Ki'sponse  hi.-ia, 

OltA 


ol 

- 

1 r ii'iidah 
h(,  -()H  . 

'P 

upon  peraona 

] live; 

a 1 i j’.a  I or 

Kr 

ror 

rai  iii)',.s.  .lotirnal  o( 

7,  10.  12,  i:) 
U-M 


Sheiiek  , U . h . 
eon  I rov<’  r a i .a  I 


A C.  Coodman.  Ueaeliona  to  propa,..anda  on  holh  aiMea  ol 
I sane.  Piihl  i e Op  j n f on  Qua  r t er  I y . I<nq,  ) (|)^  107-112 


.1 


Shepai'd,  U.  N.  ']'!»•  analy.sia 
.in  unknown  diaianee  liiiieLion. 


of  prox  iiii  i t iea  : Mu  I l i d i mona  i ona  I ae;ilinj>  wilh 
I-  jI^lipim-1  i-ika  . !'K)2  (a),  |2')-|/,'o. 


Shepard,  K,  N.  The  aiialyala  ol  prox  liiil  U(>a : Mu  I 
an  unknown  diaianee  linu  Llon.  1 1 . Payehoiiiel  r i ka  . 


ill  imiMia  iona  1 aealin)',  wilh 
J'^h2  (h),  21'»-2/i’(). 


-22')  - 


Sheppard  D The  adequacy  of  everyday  quantitative  expressions  as 
measurements  of  qualities.  British  Journal  of  Psycholo^,  1954,  45 


40-50. 


Rating  Scales,  Response  Alternatives 
Psvch.  Abst. , 28,  #6763  (Rev.  from  rept.) 


Sherif,  C.  W.  Established  reference  scales  and  series  effects  in  social 
judgment.  Dissertation  Abstracts,  ^(6),  2083. 


Response  Bias,  Response  Alternatives 


3f,  12 


ORA 


A-H 


Sherif,  C.  W.,  Sherif,  M.  , & Nebergall,  R.  E.  Attitude  and  attitude 
change . Philadelphia;  W.  D.  Suanders,  1965. 


Sherif,  M.  A study  of  some  social  factors  in  perception. 
Psychology . New  York,  1935,  No.  187. 


Archives  of 


Sherif  M.  & Hovland,  C.  I.  Judgmental  phenomena  and  scales  of  attitude 
measurement:  Placement  of  items  with  individual  choice  of  number  of  cate- 
gories. Journal  of  Abnormal  and  Social  Psychology,  1953,  135-141. 

Attitude  Measure,  Respondent's  Motivation,  9,  11,  12 

Response  Bias,  Card  Sorts 

Psych.  Abst.,  28,  #732  (Rev.  from  rept.)  R-M 


Sherif  M Taub,  D.  , & Hovland,  C.  I.  Assimilation  and  contrast  effects 
of  anchoring  stimuli  on  judgments.  Journal  of  Experimental  Psychology, 
1958,  150-155. 


Investigator  Error,  Raters 


Response  Alternatives 


3f 


Shipley,  W.  C.,  Coffin,  J,  I.,  & Hadsell,  K.  C.  Affective  distance  and 
other  factors  determining  reaction  time  in  judgments  of  color  preference. 
Journal  of  Experimental  Psychology,  1945,  ^(3),  206-215. 


* Shipley,  W.  C.,  Norris,  E.  D. , & Roberts,  M.  L.  The  effect  of  changed 
polarity  of  set  on  decision  time  of  affective  judgments.  Journal  of  Ex- 
perimental Psychology,  1946,  ^(3),  237-243. 

Paired  Comparison  Items,  Response  Bias  3g,  12 

ORA  R-M 


Shoemaker,  D.  M.  An  application  of  item-examinee  sampling  to  scaling 
attitudes  (ERIC  Documentary  Reproduction  Service, ED  060  026).  Paper 
presented  at  the  Annual  Meeting  of  the  American  Research  Association, 
Chicago,  1972. 


Shor , J.  Report  on  a verbal  projective  technique.  Journal  of  Clinical 
Psychology , 1946,  2,  279-282. 


Shukla , A.  N.,  Sohal,  T.  S.,  & Gupta,  J.  P.  A similitudinal  study  of 
Gausset  t-test,  Edwards  25-D,  t HL-test,  Edward  and  Kilpatrick's  scale  dis- 
crimination r-phi,  bi-serial  coefficient,  point  biserial  coefficient  and 
Guilford's  phi  coefficient  for  item  analysis  in  the  construction  of  summated 
rating  scale.  Indian  Journal  of  Psychology,  1971,  ^(4),  329-340. 


Shulson,  V.,  & Crawford,  C.  C.  Experimental  comparison  of  true-false  and 
completion  tests.  Journal  of  Educational  Psychology,  1928,  1^,  580-583. 


Shuttleworth,  F.  K.  A study  of  questionnaire  technique.  Journal  of 
Educational  Psychology,  1931,  ^(9),  652-658. 


* Sicinski,  A.  Don't  know  answers  in  cross  national  surveys.  Public  Opinion 
Quarterly , 1970,  34,  126-129. 

Response  Bias  9,  12 


ORA 


R-M 


-227- 


Sieber,  S.  D.  The  integration  of  fieldwork  and  survey  methods.  American 
Journal  of  Sociology,  1973,  TH,  1335-1359. 


* Siegel,  A.  1.,  & Schultz,  D.  G.  Generalized  Thurstone  and  Guttman  scales 
for  measuring  technical  skills  in  job  performance.  Journal  of  Applied 
Psychology , 1961,  137-142. 


* Siegel,  A.  I.,  & Schultz,  D.  G.  Thurstons  and  Gutman  scaling  of  job 
related  technical  skills.  Psychological  Reports,  1962,  1^,  855-861. 

Scaling,  ^heck  List 

Psych.  Abst.,  37,  #5770  A-M 


* Siegel,  A.  I.,  Schultz,  D.  G.,  & Benson,  S.  Post-training  performance 
criterion  development  and  application:  A further  study  ^'nto  techni^j 

performance  check  list  criteria  which  meet  the  Thurstone  and  Guttman  scal- 
ability requirements.  Wayne,  Pa.:  Applied  Psychological  Services,  1960. 

Scaling 

Psych.  Abst.,  35,  #1325  A-N 


Siegel,  A.  I.,  Schultz,  D.  G.,  & Fischl,  M.  A.  Absolute  scaling  of  job 
performance.  .Tournal  of  Applied  Psychology,  1968,  5^(4),  313-318. 


* Siegel,  L.  C.,  & Siegel,  L.  Item  sorts  versus  graphic  procedure  for  ob- 
taining Thurstone  Scale  judgments.  Journal  of  Applied  Psychology,  1962, 
57-61. 

Scaling,  Card  Sorts,  Rating  Scales  1,  2 

Psych . Abs  t . , 36 , #5  GD  57S  A-H 


Siegel,  S.  Nonparametric  statistics  for  the  b»hayjoral  sciences.  New  York: 
McGraw-Hill,  1956. 


-228- 


L 


Siegman,  A.  W.,  & Pope,  B.  Ambiguity  and  verbal  behavior  in  the  initial 
interview.  Proceedings  of  the  76th  Annual  Convention  of  the  American  Psych- 
ological Association,  1968,  521-522. 


* Siegman,  A.  W.,  Pope,  B.,  & Bian,  T.  Effects  of  interviewer  status  and 
direction  of  interviewer  messages  on  interviewee  productivity.  Proceedings 
of  the  77th  Annual  Meeting,  American  Psychological  Association,  1969,  4 
(pt.2),  541-542. 

Investigator  Error  13 

Bauman,  Rogers,  and  Weiss,  1971  A-M 


Siegmann,  P.  J.  A comparison  of  factor  analysis  with  Guttman's  scaling 
technique.  Dissertation  Abstracts,  1960,  M,  3368-3369. 


Siller,  J.,  Chipman,  A.  Response  set  paralysis:  Implications  for  measure- 
ment and  control.  Journal  of  Consulting  Psychology,  1963,  2J_,  432-438. 

Response  Bias  12 

Journal  of  Consulting  Psychology,  27 , p.  432.  R-H 


Silverman,  F.  H.  Intraclass  correlation  coefficient  as  an  index  of  relia- 
bility of  median  scale  values  for  sets  of  stimuli  rated  by  equal-appearing 
intervals.  Perceptual  and  Motor  Skills.  1968,  ^(3,  Pt.  1),  878. 


Silverman,  F.  H.  Interpretation  of  the  correlations  between  sets  of  scale 
values  and  stimuli  which  have  been  rated  for  more  than  a single  attribute. 
Perceptual  and  Motor  Skills,  1971,  ^(2),  667-669. 


* Silverman,  R.  E.  The  Edwards  Personal  Preference  Schedule  and  social  de- 
sirability. Journal  of  Consul  ing  Psychology,  1957,  402-404. 


Response  Bias,  Questionnaire  Theory  and  Development 
Psych.  Abst.  , _33,  #1317 


12,  14 


Silverstein,  A.,  & Dienstiber,  R.  A.  Rated  pleasantness  and  association 
value  of  101  English  nouns.  Journal  of  Verbal  Learning  and  Verbal  Behavior. 
1968,  2(1) > 81-86. 


Simon,  R.  Responses  to  personal  and  form  letters  in  mail  surveys.  Journal 
of  Advertising  Research,  1967,  2(1) > 28-30. 


Respondent's  Motivation,  Anonymous  Respondent 


11,  15 


ORA 


R-H 


Simpson,  R.  H.  The  specific  meanings  of  certain  terms  indicating  different 
degrees  of  frequency.  Quarterly  Journal  of  Speech,  1944,  30,  328-330. 

Investigator  Error,  Clarity  6 , 

ORA  R-H 


Sims,  V.  M.  An  evaluation  of  five-,  ten-,  and  fifteen-item  rearrangement 
tests.  Journal  of  Educational  Psychology,  1934,  25,  251-257. 

Achievement  Measures,  Instrument  Format,  Instrument  1,  5 

Length,  Rearrangement  Items,  Response  Alternatives 

ORA  R-N 


Sims,  V.  M. , Sc  Knox,  L.  B.  The  reliability  and  validity  of  multiple- 
response  tests  when  presented  orally.  Journal  of  Educational  Psychology, 
1932,  23,  656-662. 


Sisson,  E.  D.  Forced  choice-the  new  Army  rating.  Personnel  Psychology, 
1948,  I,  365-381. 


Sjbberg,  C.  A questionnaire  on  questionnaires. 
1954,  1^,  423-427. 


Public  Opinion  Quarterly, 


1 


* Sjoberg,  L.  A study  of  four  methods  for  scaling  paired  comparisons  data, 
Scandinavian  Journal  of  Psycliology,  1965  , ^(3),  173-185. 

Scaling,  Data  Analysis  14 

Psych.  Abst. , 40 , #2094  A-M 


Skager,  R.  M. , Bussis,  A.  M. , & Schultz,  C.  B.  Comparison  of  information 
scales  and  like-indifferent-dislike  scales  as  measures  of  interest.  Psych- 
ological Reports,  1965,  _1^,  251-61. 

Interest  Measures,  .Achievement  Measures  ■ 18 


Skelley,  P.  R.  Interviewer-appearance  stereotypes  as  a possible  source 
of  bias.  Journal  of  Marketing.  1954,  J^(l) , 74-75. 


Skindrud,  K.  D.  An  evaluation  of  observer  bias  in  experimental  field 
studies  of  social  interaction.  Final  Report  (ERIC  Document  Reproduction 
Service,  ED  072  105).  Eugene,  Ore.:  Oregon  Research  Institute,  1972. 

Raters  13 


ERIC  Document  Reproduction  Service,  ED  072  105 


* Slater,  P.  The  test-retest  reliability  of  some  methods  of  multiple  com- 
parison. British  Journal  of  Mathematical  and  Statistical  Psychology,  1965, 
18(2),  227-242. 


Ranking,  Paired  Comparison  Items,  Response  Alternatives, 
Reliability 

Psych.  Abst. , #6091 


2,  3a 


S let to,  R.  F.  Construction  of  personality  scales  by  the  criterion  of 
internal  consistency.  New  York:  Sociological  Press,  1937. 


* Sletto,  R.  F.  Pretesting  of  questionnaires.  Ar.erican  Sociological 
Review,  1940,  193-200. 

Attitude  Measures,  Instrument  Format,  Instrument  3c,  3g,  5,  7, 

Length,  Respondent's  Motivation  11,  14,  4 

Potter,  Sharpe,  Hendce , & Clark,  1972  (Rev.  from  rept.)  R-H 


* Slocum,  W.  L.,  Empey,  L.  T.,  & Swanson,  H.  S.  Increasing  response  to 

questionnaires  and  structured  interviews.  American  Sociological  Review,  1956, 
21(2),  221-225. 

Respondent's  Motivation  11 

Bureau  of  Census,  #7111003501  A-M 


Slonin,  M.  J.  Sampling.  New  York:  Simon  and  Schuster,  1966. 


* Small,  D.  0.,  & Campbell,  D.  T.  The  effect  of  acquiescence  response-set 
upon  the  relationship  of  the  F Scale  and  conformity.  Soc iometry , 1960,  23 , 
69-70, 

Response  Bias  12 

Psych.  Abst.  , #2093  A-M 


* Smith,  D.  H.  Correcting  for  social  desirability  response  sets  in  opinion- 
attitude  survey  research.  Public  Opinion  Quarterly,  1967,  31(1),  87-94. 

Data  Analysis,  Response  Bias,  Scoring  8,  12 

ORA  R-H 


Smith,  E.  M.,  & Mason,  J.  R.  The  influence  of  instructions  on  respondent 
error.  Journal  of  Marketing  Research,  1970,  2(2),  254-255, 

N/A  18 

ORA  R-NA 


-232- 


Smith,  F.  F.  Direct  validations  of  questionnaire  data.  Educational 
Administration  and  Supervision.  1935,  H,  561-575. 


Smith,  F.  F.  The  relation  between  objectivity  and  validity  in  the  arrange- 
ment of  items  in  rank  order.  Journal  of  Applied  Psychology,  1936,  20 , 
154-160. 

Investigator  Error,  Preference  Measures,  Ranking,  10,  12,  13, 

Raters,  Reliability,  Response  Bias,  Validity  14 

ORA  ' 


Smith,  G.  H.  Motivation  research  in  advertising  and  marketing.  New  York: 
McGraw-Hill,  1954. 


Smith,  H.  L.  A critique  of  ipsative  measures  with  special  reference  to 
the  Navy  Activities  Preference  Blank.  USN  PRA  Technical  Bulletin  No. 
65-16. 

Literature  Review,  Investigator  Error,  Questionnaire  16,  8,  14 

Theory  and  Development 

Psveh.  Abst.,  #15270  A-M 


Smith,  H.  L.,  & Hyman,  H.  The  biasing  effect  of  interviewer  expectations 
on  survey  results.  Public  Opinion  Quarterly,  1950-51,  1A(3)  , 491-506. 

Interviews,  Investigator  Error  12,  13,  10,  9 

Psych.  Abst . , 26  , #2110  (Rev.  from  rept.)  R-H 


Smith,  K.  An  investigation  of  the  use  of  "double"choice"  items  in  testing 
achievement.  Journal  of  Educational  Research,  1958,  387-389. 

2,  3a,  10 


Reliability,  Multiple  Choice  Items 
Psych.  Abst. , 3^,  #6925 


A-N 


* Smith,  P.  C.,  Kendall,  L.  M.  Re urans la tion  of  expectations;  An  approach 
to  the  construction  of  unambiguous  anchors  for  rating  scales.  Journal  of 
Applied  Psychology,  1963,  ^(2),  149-155. 


Rating  Scales,  Reliability 
Psych ■ Abst . , 37 , #7981 


3f,  8 


Smith,  P.  C.,  Kendall,  L.  M. , & Hulin,  C.  L.  The  measurement  of  satisfac- 
tion in  work  and  retirement:  A strategy  for  the  study  of  attitudes.  Chicago; 
Rand  McNally,  1969. 


Smith,  R.  G.,  & Nichols,  H.  J.  Semantic  differential  stability  as  a 
function  of  meaning  domain.  Journal  of  Coiiuuunica tion , 1973,  23(1)  , 64-73. 


* Solomon,  A.  The  effect  of  answer  sheet  format  on  test  performance  by 
culturally  disadvantaged  fourth  grade  elementary  school  pupils.  Journal 
of  Educational  Measurement,  1971,  8,  289-290. 

Instrument  Format  ^8 


* Soueif,  M.  I.  Extreme  response  sets  as  a measure  of  intolerance  of  am- 
biguity. British  Journal  of  Psychology,  1958,  329-334. 


Rating  Scales,  Response  Bias 
Psych . Abs t . , 33 , #9993 


10,  12 


Souren,  G.  Het  Rosenthel  effect  (the  Rosenthal  Effect)  (AD  870  528) 
Netherlands:  Leiden  Rijksuniversiteit,  Psychologisch  Instituut,  1969. 
Report  No.  SP-002-69. 


Spaltro,  E.  The  semantic  differential  as  a method  to  measure  attitudes, 
r.nntri' huti -Dell ' istituto  di  Psicologia,  1967,  166-235. 


Speak,  M.  Some  characteristics  of  respondents,  partial-respondents,  and 
non-respondents  to  questionnaires  on  job  satisfaction.  Occupational 
PsycholoRv , 1964,  38(3-4) , 173-182. 


* 


Speak,  M.  Communication  failure 
and  personal  frames  of  reference. 


169-181. 


in  questioning:  Errors,  misinterpretations. 
Occupational  Psychology.  1967,  41(4)  . 


Investigator  Error,  Interviews  4 

ORA  R-M 


Spec  tor,  A.  J.  Influences  on  merit  ratings.  Journal  of  Applied  Psychology. 
1954,  la,  393-396. 

Investigator  Error  13 

Psych.  Abst.  , #6336  A-M 


Spec  tor,  A.  J.  Forced-choice  and  proiective  techniques  in  attitude  measure- 
ment (AD  134  221).  Maxwell  AFB,  Ala.:  Officer  Education  Research  Laboratory, 
1957(a)  . 

N/A  N/A 

N/A  R-NA 


* Spector,  A.  J.  The  user's  role  in  constructing  a human  relations  test. 
Personnel  Psychology,  1957  (b) , 1^,  145-156. 

Attitude  Measures,  Questionnaire  Theory  and  8,  10,  12 

Deve lopment 

Psych.  Abst.  , 32,  #2305  A-M 


Spiritas,  A.  A.,  & Holmes,  D.  S.  Effects  of  models  on  interview  responses. 
Journal  of  Counseling  Psychology.  1971,  j^(3),  217-220. 


-235- 


Staats,  A.  W,  Names  as  reinforccrs:  The  social  value  of  verbal  stimuli 
(AD  719  414).  Honolulu,  Hawaii:  Hawaii  University,  1970.  Report  No.  TR-9. 


N/A 

N/A 

N/A 

T-M 

Stagner,  R.  The  cross-out  technique  as  a method  in  public  opinion  analysis. 
Journal  of  Social  PsycholoKy,  1940,  _1_1,  79-90. 

X-0  Test  Items  . 2 

ORA  R-H 


Stagner,  R.,  & Osgood,  C.  E.  Impact  of  war  on  a nationalistic  frame  of 
reference:  I.  Changes  in  general  approval  and  qualitative  patterning  of 
certain  stereotypes.  Journal  of  Social  Psychology,  1946,  U,  187-215. 


Stalnaker 
tests  in 
102-109. 


, J.  M. , et  al.  Construction  and  application  of  psychological 
the  armed  services.  Review  of  Educational  Research.  1944,  14 , 


* Stangenberg,  C.  New  definitions  of  scale-types.  Theoria:  A Swedish 
Journal  of  Philosophy,  1966,  ^(1),  56-61. 

Scaling  14 

Psych.  Abst. , 40 , #12708  A-H 


Stanley,  J.  C.  A comparison  of  verbal  and  pictorial  self-rating  scale 
categories.  Journal  of  Experimental  Education.  1955,  239-246. 

3g 


Question  Stem 

Psych.  Abst.  , 30,  #1918 


A-H 


Stanley,  J.  C.,  & Wang,  M.  D.  Differential  weighting:  A survey  of  methods 
and  empirical  studies  (ERIC  Document  Reproduction  Service,  ED  030  148). 
Baltimore,  Maryland:  John  Hopkins  University,  Center  for  the  Study  of 
Social  Organization  of  Schools,  1968. 

Scoring  18 

ERIC  Document  Reproduction  Service,  ED  030  148  A-N 


Stanton,  F.  Notes  on  the  validity  of  mail  questionnaire  returns.  Journal 
of  Applied  Psychology.  1939,  2^,  95-104, 

Stanton,  F.,  & Baker,  K.  H.  Interviewing  bias  and  the  recall  of  incomplete- 
ly learned  materials.  Sociometry , 1942,  123-134. 

Interviews,  Investigator  Error  12 

ORA  from  summary  in  Psych.  Abst.  , #832  A-M 


Stanton,  H.  E.  Rating  or  inventory:  A comparison  of  two  approaches  to 

personality  measurement.  Australian  Psychologist,  1972,  > 33'39- 


Starry,  A.  R.,  & others.  Stability  ratings  as  classifiers  of  life  history 
item  retest  reliability.  Journal  of  Applied  Psychology,  1969,  _53(1),  14-18. 


Stary,  D.  Does  the  modified  method  of  forced  choice  eliminate  the  rater's 
bias.  Revija  za  Psiholgiiu,  1970,  1^(1),  19-22, 

N/A  N/A 

N/A  T-H 


Staugas,  L.,  & McQuitty,  L.  L.  A new  application  of  forced-choice  ratings. 
Personnel  Psychology,  1950,  3,  413-424. 


2,  14 


Forced  Choice  Items,  Rating  Scale 
Psych.  Abst. , 25 , #3985  (Rev.  from  rept.) 


R-M 


Stebbins,  R.  A.  The  unstructured  research  interview  as  incipient  inter- 
personal relationship.  Sociology  and  Social  Research.  1972,  ^(2),  164-179. 


Steele,  H.  L.  On  the  validity  of  projective  questions.  Journal  of 
Marketing  Research,  1964,  46-49. 

Attitude  Measures,  Interviews,  Open-ended  Items,  1 , 2 , 10 , 3g 

Validity,  Projective  Items 

ORA  R'H 


Stefflre,  B.  The  reading  difficulty  of  interest  inventories.  Occupations , 
1947,  95-96. 


Steinbock,  C.  B.  A comparison  of  three  item  formats  with  respect  to  re- 
liability, fakability,  and  acceptability  to  respondents.  Dissertation 
Abstracts  International,  1972,  3^(5-A) , 2182-2183. 

Rating  Scales,  Ranking,  Check  List,  Respondent's  2,  11 

Motivay^on 

Dissertation  Abstracts  Interna l iona 1 , J^(5-A),  A-M 

pp.  2182-2183  (Rev.) 


Steiner,  I.  D.  Scalogran.  analysis  as  a tool  for  selecting  poll  questions. 
Public  Opinion  Quarterly,  1955,  1^,  415-424. 

'* 

Question  Stem  3g 

Psych.  Abst.  , 3^,  #380  A-M 


Steinheiser,  F 
"similarities" 
327. 


H.  Individual  preference  scales  within  a multidimensional 
space.  Journal  of  Experimental  Psychology,  1970,  325- 


Steinmetz,  H.  C.  Measuring 
of  Applied  Psychology,  1932, 


ability  to  fake  occupational  interest.  Journa 1 
16. 


Stember,  H, , 6c  Hyman,  H.  How  interviewer  effect?  operate  through  question 
form.  International  Journal  of  Opinion  and  Attitude  Research,  1949,  3, 
493-512. 

Investigator  Error,  Response  Alternative,  3a,  13,  11 

Respondent's  Motivation 

Psvrh.  Abst.  , 2^,  #5793 


SLember,  H,,  & Hyman,  H.  Interviewer  effects  in  the  classification  of 
responses.  Public  Opinion  Quarterly,  1949-50,  13(4),  669-682. 


Stephan,  F.  F. , 6.  McCarthy,  P.  J.  Sampling  opinions:  An  analysis_of 
survey  procedure.  New  York;  John  Wiley,  1963. 


Stephenson,  W.  The  study  of  behavior. 
Press,  1953. 

Textbook,  Card  sorts 


Chicago;  University  of  Chicago 


17 


ORA 


R-M 


Stern,  C.  G.  Congruence  and  dissonance  in  the  ecology  of  college  students. 
■Sf-nHent  Medicine,  1960,  8,  304-339. 


Stevens,  S.  N. , & Wonderlic,  E.  F.  An  effective  revision  of  the  rating 
technique.  Personnel  Journal,  1934,  1_3,  125-134. 


Stevens,  S.  S.,  On  the  theory  of  scales  of  measurement.  Science,  1946, 
103  (2684)  , 677-680. 

Data  Analysis , Instrument  Format  8>  3 

Abaum,  et  al.  (Eds.)  Scientific  Marketing  Research,  R-N 

New  York;  Scott  Foresman,  1972.  (Rev.  from  rept.) 


Stevens,  S,  S.  Mathematics,  measurement,  and  psychophysics.  In  Stevens, 
S.  S.  (Ed.),  Handbook  of  experimental  psychology.  New  York:  Jol.n  Wiley, 
1951. 


Stevens,  S.  S. 
Gul liksen , H. , 

applications. 


Ratio  scales,  partition  scales  and  confusion  scales.  In 
& Messick,  S.  J.  (Eds.),  Psychological  scaling:  Theory  and 
New  York:  Wiley,  1960. 


Stevens,  S.  S.  Psychophysics  and  social  scaling. - Morristown,  N.  J.: 
General  learning,  1972. 


Steward,  V.  The  problem  of  detecting  "fudging"  on  vocational  interest 
tests.  Los  Angeles,  Calif.;  Personnel  Reports  for  Sales  Executives,  1947. 


Stewart,  L.  H. , & Ronning,  R.  R.  The  use  of  anchors  in  equisection  scaling 
of  interests.  California  Journal  of  Educational  Research,  1968,  19(3), 
127-131. 

Response  Alternatives,  Scaling,  Interest  Measures  3f,  14 


Stewart,  N.,  & Nelson,  B.  Methodological  investigation  of  the  forced- 
choice  technique  utilizing  the  officer  description  and  the  officer  evalua- 
tion blanks.  Study  No.  701:  PRS-SSU , EAR-NS-e  b PR  4061-04.  Adjutant 
General's  Office,  1945. 


N/A 


N/A 


N/A 


T-H 


Stezl.  I.  On  the  reliability  of  difference  scores  from  scales  with  partly 
identical  items.  Zeitschrift  fur  experimente lie  und  angewandte  psychologie  , 
1971,  18(1),  157-165. 


Stocking,  M.  Short  tailored  tests.  Research  Bulletin  69-62  and  Office 
of  Naval  Research  Technical  Report,  Contract  N-000 14-69-C-OO 1 7 . Princeton, 
N.J.:  Educational  Testing  Service,  1969. 


-240- 


Stokes  , S . M . , 
naire  replies. 


6i  Lehman,  H,  C,  Influence  of  self-interest  upon  question- 
School  and  Society,  1930,  2^,  435-438. 


Stone,  L.  A.  Magnitude  estimation  and  numerical  category  scale  evaluations 
of  category  scale  adjectival  stimuli  on  three  clinical  judgmental  continue. 
Journal  of  Clinical  Psychology,  1970,  2^(1),  24-27. 


Stouffer,  S.  A.  An  experimental  comparison  of  statistical  and  case  history 
methods  of  attitude  research.  Unpublished  doctoral  thesis,  University  of 
Chicago,  1930. 


Stouffer,  S.  A.  Studying  the  attitudes  of  soldiers.  Proceedings  of  the 
American  Philosophical  Society,  1948,  9^,  336-340. 

Attitude  Measures,  Military  Personnel  14 

Psych.  Abst . , 23 , #3170  A-M 


Stouffer,  S.  A.,  Borgatta,  E.  I’.,  Hays,  D.  G.  , & Henry,  A.  F.  A technique 
for  improving  cumulative  scales.  Public  Opinion  Quarterly,  1952,  16, 
273-291. 


* Stouffer,  S.  A.,  et  al.  Studies  in  social  psychology  in  World  War  II. 
Volume  4.  Measurement  and  prediction.  Princeton,  New  Jersey:  Princeton 
University  Press,  1949. 

Military  Personnel,  Scaling,  Questionnaire  Theory  2,  8,  14 

and  Development,  Scoring,  Textbook 

ORA  R-M 


Stover,  R.  E.  The  measurement  of  change  in  a unidimensional  attitude  by 
Guttman  scale  analysis  techniques.  Public  Opinion  Quarterly.  1958,  22  . 
116-122. 

Military  Personnel,  Instrument  Format  3g 

Psych.  Abst . , 33 , #10143  (Rev.  from  rept.)  R-H 


-241- 


* Strahan,  R.  Subject  satisfaction  with  questionnaire  assessment  a function 
of  response  format . Proceedings  of  the  Annual  Convention  of  the  American 
Psychological  Association,  1971,  6 (Pt.  1),  125-126. 


Response  Alternatives,  Rating  Scales,  True-False 
Items 


3f,  3a 


Straits,  B.  C. , Wueben,  P.  L. , & Theophile,  J.  M.  Influences  on  subjects' 
p0i7eeptions  of  experimental  research  situations,  ^ ^^c^iom^t_r^ , 1972  , 3 5 ( 6 ) , 
499-518. 


Strieker,  L.  J.  Some  item  characteristics  that  evoke  acquiescent  and 
social  desirability  response  sets  on  psychological  scales.  Dissertat ion 
Abstracts , 1962,  ^(11),  4077-4078. 


Attitude  Measures,  Personality  Measures, 

Response  Bias 

Dissertation  Abstracts,  ^(11),  pp.  4077-4078  (Reve) 


12,14 


* Strieker,  L.  J.  Acquiescence  and  social  desirability  response  styles, 
item  characteristics,  and  conformity.  Psychological  Reports,  1963, 
319-341. 

Response  Bias,  Attitude  Measures,  Personality 

Measures,  Clarity  ^ 

Psych.  Abst.  , 38  , #4225  A-H 


Strieker,  L.  J.  Sequential  trends  in  response  styling.  Research  Memoran- 
dum 64-7.  Princeton,  New  Jersey:  Educational  Testing  Service,  1964. 


Strong,  E,  K.  , Jr.  Procedure  for  scoring  an  interest  test.  Psychol . 
Clinic’,  1930,  19,  63-72. 


Stuart,  A.  Basic  ideas  of  scientific  sampling.  London:  Griffin,  1962. 


-242- 


American 


Stycos,  J.  M.  Interviewer  effect  on  scale  reproducibility. 
Sociological  Review.  1955,  ^^(4),  443-446. 


Sublette,  D.  J.  The  preparation  of  pencil  and  paper  tests.  Public 
Personnel  Review,  1941,  1-17. 


Suchman , D.  I.  Responses  of  subjects  to  two  types  of  interviews. 
Dissertation  Abstracts,  1967,  2_7  (9-B)  , 3297. 


Interviews  2 

Dissertation  Abstracts,  22(9-B)  , p.  3297.  A-M 


Suchman,  E.  A.  The  intensity  component  in  attitude  and  opinion  research. 
In  Stouffer,  S.  A.,  et  al.  (Eds.),  Measurement  and  prediction.  Princeton, 
New  Jersey:  Princeton  University  Press,  1950. 


* Suchman,  E.  A. , & Guttman,  L.  A solution  to  the  problem  of  question 
"bias".  Public  Oylnlon  Quarterly,  1947,  445-455. 

Investigator  Error  13 

Potter,  Sharpe,  Hendee  and  Clark,  1972  A-H 


Suchman,  E,  A.,  & McCandless,  B.  Who  answers  questionnaires?  Journal  of 
Applied  Psychology,  1940,  2^,  758-769. 


Suckinan,  E.  A.  Scale  analysis  and 
and  opinion  research.  Princeton,  N. 


1950. 


the 
J. ; 


intensity  component  in  attitude 
Princeton  University  Press, 


Sudman,  S.  On  the  accuracy  of  recording  of  consumer  panels-Part  II. 
Journal  of  Marketing  Research.  1964,  _1(3)  , 69-83. 


Sudman,  S.  New  approaches  to  control  of  interviewing  costs.  Journal  of 
Marketing  Research,  1966,  56-61. 


-243- 


Sudman,  S.,  & Bradburn,  N.  M.  Response  effects  in  surveys:  A review  and 
synthesis.  Chicago:  Aldine  Publishing  Co.,  1974. 


Response  Bias,  Interviews  12 

ORA  A-H 


Sudman,  S.,  & Ferber,  R.  A coiiiparison  of  alternative  procedures  for  col- 
lecting consumer  expenditure  data  for  frequently  purchased  products. 
Urbana,  111.:  Illinois  University,  Survey  Research  Lab.,  19/3.  Faculty 
Working  Paper  No.  87. 


Sudman,  S.,  Greeley,  A.,  and  Pinto,  L.  The  effectiveness  of  self-adminis- 
tered questionnaires.  Journal  of  Marketing  Research.  i965,^  293-297. 

Interviews,  Attitude  Measures  1 

ORA  R-H 


Summers,  G.  F. , & Hammonds,  A.  D.  Effect  of  racial  characteristics  of 
investigator  on  self-enumerated  responses  to  a Negro  prejudice  scale. 
Journal  of  Social  Forces,  1966,  4^(4),  515-518. 

Attitude  Measures,  Investigator  Error  9,  12 

Journal  of  Social  Forces.  44 , p.  515  (Rev.  from  rept.)  R-H 


Sundland,  D.  M.  The  construction  of  Q sorts:  A criticism.  Psychological 
Review . 1962,  62-64, 


Survey  Research  Centre.  Respondent  understanding  of  questions  in  the 
survey  interviews.  London:  Reprint  Series,  Survey  Research  Centre,  London 
School  of  Economics  and  Political  Science,  1967  (a). 


N/A 

N/A 

N/A 

T-H 

-244- 


r 


Survey  Research  Centre.  The  semantic  differential  scaling  system  in 
market  research:  1.  Order  effects.  London:  Reprint  Series,  Survey 
Research  Centre,  London  School  of  Economics  and  Political  Science,  1967(b). 

N/A 

N/A  T-M 

Survey  Research  Center.  I nterviever ' s manual.  Ann  Arbor,  Mich.: 
University  of  Michigan,  Survey  Research  Center,  Institute  for  Social 
Research,  1969  (a). 


Survey  Research  Centre.  The  semantic  differential  scaling  system  in 
market  research:  ill.  Interviewer  deviation  from  instructions.  London; 
Reprint  Series,  Survey  Research  Centre,  London  School  of  Economics  and 
Political  Science,  1969  (b). 


N/A 

N/A 

N/A 

T-M 

Survey  Research  Centre.  A comparison  of  the  vertical  and  horizontal 
systems  of  presenting  differential  rating  scales.  London:  Reprint 
Series,  Survey  Research  Centre,  London  School  of  Economics  and  Political 
Science,  1970. 

Semantic  Differential  Items,  Instrument  Format,  3e 

Response  Alternatives 


* Survey  Research  Centre.  The  extent  and  the  nature  of  order  effects  in 

using  the  semantic  differential  scaling  technique.  London:  Reprint  Series, 
Survey  Research  Centre,  London  School  of  Economics  and  Political  Science, 
1972. 

Instrument  Format,  Semantic  Differential  Items,  3c,  9 

Question  Stem 

ORA 


-295- 


J 


SuLclifle,  J.  P.,  & bristow,  K.  A.  Do  rank  order  and  scale  properties 
remain  invariant  under  changes  in  the  set  of  scaled  stimuli?  Aust  ra 1 ian 
Journal  of  Psychology,  IDbb,  2b'4U, 


N/A 

N/A 

N/A 

T-H 

Swine  ford,  F.  1971  AFKA  Conference  siiminaries : Innovations  in  measurement 
Princeton,  New  Jersey:  KKIC  Clearinghouse  on  Tests,  Measurement  and  Evalua 
tion,  1972.  Report  No.  'l’M-K-15. 


* Swordes,  A.  Effect  of  changing  the  number  of  item  responses  from  five  to 
four  in  the  same  test.  Journal  of  Applied  Psychology,  1952,  342-343, 

Multiple  Choice  Items,  Response  Alternatives,  2,  3g,  13 

Clarity,  Instrument  format.  Investigator  Error 

Psych.  Abst . , 27  , #4752  A-H 


* Symonds,  P.  M.  On  the  loss  of  reliability  in  ratings  due  to  coarseness 
of  the  scale.  Journal  of  Experimental  Psychology,  1924,  456-.61. 

Data  Analysis,  Personality  Measures,  Reliability, 

Response  Alternatives,  Scaling  3a,  8 

ORA  R-H 


* Symonds,  P.  M.  Influence  of  order  of  presentation  of  items  in  ranking. 
Journal  of  Educational  Psychology,  1936,  445-449. 

Attitude  Measures.  Instrument  Format,  Ranking  3b,  3c,  9 

ORA  R-11 


Symonds,  P.  M.  Research  on  the  interviewing  process.  Journal  of  Educa- 
tional Psychology,  1939,  2jJ,  346-353. 


Szaly,  L.  B.,  et  al.  Attitude  measurement  by  free  verbal  association. 
Journal  of  Social  Psychology,  1970,  ^(i),  43-55. 


Tajfel,  H.  The  anchoring  effects  of  value  in  a scale  of  judgments, 
British  Journal  of  Psychology.  1959,  294-304. 


Investigator  Error,  Response  Bias,  Raters, 
Response  Alternatives 

ORA 


3f,  12 
R-H 


Tajfel,  H.  , Richardson,  A.,  & Everstine,  L,  Individual  consistencies  in 
categorizing:  A study  of  judgment  behavior.  Journal  of  Personality.  1964, 
32,  90-108. 


Tallent,  N.  A note  on  an  unusually  high  rate  of  returns  for  a mail  ques- 
tionnaire. Public  Opinion  Quarterly,  1959,  579-581. 


Tamir , P,  An  alternative  approach  to  the  construction  of  multiple  choice 
test  items.  Journal  of  Biological  Education,  1971,  » 305-307. 


Tate,  M.  W. , & Clelland,  R,  C.  Nonparamet r ic  and  shortcut  statistics. 

Danville,  111.,:  Interstate  Printers  and  Publishers,  1957. 

Data  Analysis  18 

ORA  8-N 


Taylor,  E.  K,,  & Hastman,  R.  Relation  of  format  and  administration  to 
the  characteristics  of  graphic  rating  scales.  Personnel  Psychology.  1956, 
9(1),  181-206. 

Ranking,  Raters,  Instrument  Format,  Scoring  3c,  8,  3g 

ORA  R-H 


Taylor,  E.  K,,  & Manson,  G.  E.  Supervised  ratings  - making  graphic  scales 
work.  Per sonne  1 , 1951,  27,  504-514. 


Taylor,  E.  K.,  et  al.  Rating  scale  content:  II  Effect  of  rating  on  indi- 
vidual scales.  Personnel  Psychology.  1958,  jA,  519-533. 


-247- 


Taylor,  I,  A.  Similarities  in  the  strvicture  of  extreme  social  attitudes. 
Psychological  MonoRraphs,  1960,  2^(2),  Whole  No.  489. 

Attitude  Measures  in 


* Taylor,  J.  B. , & Parker,  H.  A.  Graphic  ratings  and  attitude  measurement: 
A comparison  of  research  tactics.  Journal  of  Applied  Psychology  1964 
48(1),  37-42. 

Attitude  Measures,  Rating  Scales,  Reliability  ' 2 

Psych.  Abst. , 2®,  #8297  A-H 


Taylor,  J.  C. , & Bowers,  D.  G.  The  survey  of  organizations:  Towards  a 
machine-scored,  standardized  questionnaire  format.  Ann  Arbor,  Mich.: 
University  of  Michigan,  Institute  for  Social  Research,  1970.  ONR  Contract 
N00014-67-A-0 18 1-0013. 


Ter.iiouten,  W.  D.  Scale  gradient  analysis:  A statistical  method  for 

constructing  and  evaluating  Guttman  Scales.  Sociometry . 1969,  32(1),  80-98, 


* Tenopyr,  M.  L.  Internal  consistency  of  ipsative  scores:  The  "one  reliable 
scale"  phenomenon.  Proceedings  of  the  76th  Annual  Convention  of  the  Ameri- 
can Psychological  Association.  1968,  3,  245-246. 

Forced  Choice  Items,  Questionnaire  Theory  and  Develop-  14 

ment , Reliability 


* Terris,  F.  Are  poll  questions  too  difficult?  Public  Opinion  Quarter^ 
1949,  13(2),  313-319.  

Clarity  4 


-248- 


*V 


Thomas,  J.  A.,  & Sadacca,  R.  Impact  of  feedback  on  accuracy  of  confidence 
Levels  assigned  by  interpreters.  U.  S,  Army  BESRI,  Technical  Research  Note 
No.  187,  1967. 


Thompson,  C.  A.  Development  of  the  Airman  Qualifying  Examlnatlcn.  forms 
D and  E.  United  States  Air  Force  WADC , Technical  Report  No.  58-94,  Pt.  I, 
1958. 

Questionnaire  Theory  and  Development  18 

Psych.  Abst . , 33 , #11109  A-NA 


Thompson,  J.  W.  Bi-polar  and  unidirectional  scales.  British  Journal  of 
Psychology . 1963,  _W(1),  15-24. 


Thorndike,  R.  L.  Personnel  selection;  test  and  measurement  techniques. 
New  York:  Wiley,  1949. 

Questionnaire  Theory  and  Development,  Textbook  17,  14 

Psych.  Abst.  , 23^,  #5074  A-M 


Thorndike,  E.  L. , & Lorge,  R.  The  teacher's  word  book  of  30,000  words. 
New  York:  Columbia  University  Press,  1944. 

Clarity 


* I'humin,  F.  J.  Watch  for  those  unseen  variables.  Journal  of  Marketing. 
1962,  26,  59. 


Instrument  Format,  Question  Stem,  Interviews, 
Respondent's  Motivation 


3c,  7,  9.  11, 
12,  3g 


ORA 


R-M 


-249- 


Thurstone,  L.  L.  Equally  often  noticed  differences.  Journal  of  Educa 
tional  Psychology.  1927  (a),  2^,  289-293. 


Thurstone,  L.  L.  A law  of  comparative  judgment. 
1927  (b),  273-286. 

Questionnaire  Theory  and  Development 

ORA 


Psychological  Review, 

14 

R-N 


Thurstone,  L.  L.  The  method  of  paired  comparisons  for  social  values, 
nf  Abnormal  and  Social  Psychology,  1927  (c) , 21,  384  400. 


Thurstone,  L.  L.  Attitudes  can  be  measured.  American  Journal  of  Sociolo^ 
1928  (a)  , 33,  529-554. 


Thurstone,  L.  L.  The  measurement  of  opinion.  Journal  of  Abnormal  and 
Social  Psychology.  1928  (b) , 2^,  415-420. 

Questionnaire  Theory  and  Development,  Scaling  15 

Journal  of  Abnormal  and  Social  Psychology.,  27,  p.  430  R-N 


Thurstone,  L.  L.  The  indifference  function. 
1931(a),  2,  139-167. 

Questionnaire  Theory  and  Development 
ORA 


Jou-nal  of  Social  Psychology, 

18 

R-N 


Thurstone  L.  L.  The  measurement  of  social  attitudes.  Journal  of  Abnor;r^ 
and  Social  Psychology,  1931  (b) , 249-269. 


Thurstone,  L.  L.  The  measurement  of  social  attitudes, 
of  Chicago  Press,  1931(c). 

Textbook 

ORA 


Chicago;  University 


16 

T-H 


-250- 


* Thurstone,  L.  L,  The  measurement  of  values.  Chicago;  University  of 
Chicago  Press,  1959. 

Scaling 

ORA 


Thurstone,  L.  L, , & Chave , E.  J.  The  measurement  of  attitudes.  Chicago: 
University  of  Chicago  Press,  1929. 


Tiffin,  J.  Merit  rating:  Its  validity  and  techniques.  In  Dooher , M.  J., 
&I  Marquis,  V.  (Eds.),  Rating  Employee  and  Supervisory  Performance,  A 
Manual  of  Merit  Rating  Techniques.  New  York:  American  Management  Associa- 
tion, 1950. 


Timothy,  R.  Relationships  of  self-ratings  and  external  judgments  to 
relatively  objective  measurements.  University  of  Southern  California, 
Abstracts  of  Dissertat ions ...  1948  , 1948,  286-291. 


Tireman,  L.  S. , & Woods,  V.  E.  Note  on  the  influence  on  the  validity  of  a 

vocabulary  test  of  the  method  of  Indicating  responses.  Journal  of  Educa- 
tional Psychology.  1940,  153-154. 


Tittle,  C.  R.,  &<  Hill,  R.  J.  Attitudes  measurement  and  the  prediciton  of 
behavior:  An  evaluation  of  conditions  and  measurement  techniques.  Sociometry , 
1967,  199-213. 


Titus,  H.  E.,  & Hollander,  E.  P.  The  California  F scale  in  psychological 
research;  1950-1955.  Psychological  Bulletin,  1957,  47-64. 


Toman,  W.  The  Multiple  Attitude  Test:  A diagnostic  device.  Journal  of 
Abnormal  and  Social  Psychology,  1955,  5',  163-170. 

Questionnaire  Theory  and  Development  18 

Psych.  Abst.  , 3^,  #4604  A-N 


-251- 


* loops,  H.  A.  The  factor  of  mechanical  arrangement  and  typography  in 
questionnaires.  Journal  of  Applied  Psychology.  L937,  ^(2),  225-229. 


Clarity,  Instrument  Format,  Open-Ended  Items 

Potter,  Sharpe,  Hendee  and  Clark,  1972  (Rev.  from  rept . ) 


4,  8,  9,  14 


loops,  H.  A.  A comparison  by  work-limit  and  time-limit  of  item  analysis 
indices  for  practical  test  construction.  Educational  and  Psychological 
Measurement  , 1960,  251-266. 

Achievement  Measures  18 


Psych.  Abst . , 35,  #6395 


* Torgerson,  W.  S.  Theory  and  methods  of  scalinR.  New  York:  John  Wiley, 
1958. 


Scaling,  Questionnaire  Theory  and  Development 


Torgerson,  W.  S.  Scaling  and  tost  theory.  Annual  Review  of  Psychology, 
1961,  12,  51-70. 


Towne , D.  C.  PisplayinK  semantic  differential  data  in  three-dimensional 
space . Paper  presented  at  the  Annual  Meeting  of  the  American  Educational 
Research  Association,  New  York,  New  York,  1971. 


1 raub , R.  E. , Hambleton,  D.  K.  The  effect  of  scoring  instructions  and 
degree  of  speediness  on  the  validity  and  reliability  of  multiple-choice 
tests.  Education  6»  Psychological  Measurement.  1972  , 22,  (3),  737-758. 

Achii'vement  Measures  18 


ERIC  Document  Reprc'duct  ion  Service,  EJ  064  145 


-252- 


* Travers,  R.  M,  W.  A critical  review  of  the  validity  and  rationale  of  the 
forced-choice  technique.  PsychoIoRical  Bulletin,  1951,  62-70. 

Forced-Choice  Items,  Rating  Scales,  Raters  2 

Psych.  Abst.  , #664  A-M 


* Trent,  R.  The  color  of  the  investigator  as  a variable  in  experimental 
research  with  negro  subjects.  The  Journal  of  Social  Psychology,  1954,  ^ 
(2),  281-287, 

Investigator  Error  *^3 


Tressalt,  M.  E. , & Volkroann,  J.  The  production  of  uniform  opinion  by  non- 
social stimulation.  Journal  of  Abnormal  and  Social  Psychology,  1942,  3_^, 
234-243, 


Triandis,  11.  C,  Attitude  and  attitude  chanae.  New  York:  John  Wiley, 

1971. 


Trott,  D.  M. , ii  Jackson,  D.  N.  An  experimental  analysis  of  acquiescence. 
Journal  of  Experimental  Research  in  Personality,  1967,  2,  278-288. 


Trow,  M.  Comment  on  'participant  observation  and  interviewing-a  comparison.' 
Human  Organization.  1957,  j^(3),  33-35. 


* Tsudzuki,  A.  Shitsumonshi  chosaho  ni  kansuru  kenkyu  II;  muoto  no  bunseki. 
(Studies  on  the  questionnaire  method  II;  analysis  of  non-response . ) 

Japanese  Journal  of  Psychology,  1953,  24,  226-238. 


Tucker,  L.  R.  Description  of  the  choices  of  most  preferred  among  trijles 
of  stimuli  by  a preference  space  generated  from  choices  among  pairs  of  sti:nuli. 
Research  Memorandum  58-8,  Princeton,  New  Jersey;  Educational  Testing  Service, 
1957. 

N/A 
N/A 


N/A 

T-M 


-253- 


Tucker,  L.  R.  Scaling  and  test  theory.  Annual  Review  of  Psychology, 
1963,  lA,  35L-364. 

Literature  Review  16 


* Tuckman,  J.,  6t  Lorge,  1.  The  effect  of  changed  directions  on  the  attitudes 
about  old  people  and  the  older  worker.  Educational  and  Psychological 
Measurement , 1953,  J^,  57i-595. 


True-False  Items,  Response  Alternatives,  Scoring, 
Rating  Scales,  Respondent's  Motivation 


2,  3a,  LI 


Tupes,  E.  C. , 6c  Christal,  R.  E.  Stability  of  personality  trait  rating 
factors  obtained  under  diverse  conditions.  USAF  WADS  Technical  Note  No. 
58-61,  1958. 


* Turgut,  M.  F.  A comparison  between  two  forced-choice  personality  test 
formats.  Dissertation  Abstracts,  1963,  2^(5),  2118-2119. 


Card  Sorts,  Paired  Comparison  Items,  Personality 
Measures,  Respondent's  Motivation 

Dissertation  Abstracts.  2^(5),  pp.  2118-2119  (Rev.) 


3g,  2,  11 


Turner,  C.  B. , & Fiske,  D.  W.  Item  quality  and  appropriateness  of  response 
processes.  Educational  and  Psychological  Measurement , 1968,  2^,  297-315. 


Clarity,  Reliability 

Educational  and  Psychological  Measurement, 
p.  314 


3g,  4 


Tversky,  A.  Intrasitivity  of  preferences.  Psychological  Review.  1969, 
76,  31-48. 


Tversky,  A.  On  the  optimal  number  of  alternatives  at  a choice  point. 
JournaTuf  Mathematical  PsycholoR:^,  1964.  j,.  386-391, 

Multiple  Choice  Items,  Response  Alternatives  3a 


ORA 


4 


R-M 


Twedt,  D.  W.  Consumer  psychology.  Annual  Review  of  Psychology,  1965,  16 
265-294. 

t 

Tyler,  L.  E.  Tests  and  measurements.  (2nd  ed . ) Englewood  Cliffs,  N.J.. 
Prentice-Hall,  1971. 


Udell,  J.  Can  attitude  measurement  predict  consumer  behavior?  Journal 
of  Marketing,  1965,  H,  46-50. 

Attitude  Measures,  Scaling,  Validity  1^ 


Uhrbrock,  R,  S,  2000  scaled  items.  Personnel  Psychology,  1961,  14(2), 
375-420. 

Rating  Scales,  Card  Sorts 
p^vch.  Abst..  37,  #3919 


United  States  Government 
questionnaire  development 
Washington,  D.  C.:  U.S. 

Series  1,  No.  2,  1964. 


Printing  Office.  Health  survey  procedure-concept 
and  definitions  in  the  health  Interview  survey. 
Government  Printing  Office,  Health  Statistics, 


Interviews 


18 


* Upshaw,  H.  S.  Own  attitude  as  an  anchor  in  equal-appearing  intervals. 
journal  oi  Abnormal  and  Social  Psychology,  1962  , 85-96. 

Response  Bias,  Response  Alternatives 

ORA 


12,  3f 
R-H 


Upshaw,  H.  S.  The  effect  of  variable  perspectives  on  judgments  of  opinion 
statements  for  Thurstone  scales;  Equal  appearing  intervals.  Journal  of 
Personality  and  Social  Psychology,  1965,  2,  60-69. 

Data  Analysis,  Scaling,  Response  Bias  12,  1h 

Journal  of  Personality  and  Social  Psychology,  R'N 

g,  p.  60 


Upshaw,  H.  S.,  Ostrow,  T M.,  &Ward,  C.  D.  Content  versus  self-rating 
in  attitude  research.  Journal  of  Experimental  Social  Psycholoisy,  1970,  6 
(3)  , 272-279. 


Uriel,  G.  F.  Scale  and  Intensity  analysis  in  opinion  research.  Interna- 
tional Journal  of  Opinion  and  Attitude  Research,  1950,  , 192-208. 


Urry,  V.  W. , & Nicewander,  W.  A.  Factor  analysis  of  the  commander’s  eval- 
uation report.  U.S.  Arm-  Enlisted  Evaluation  Center  Technical  Report  No. 
1966. 


* U.S.  Army  Test  and  Evaluation  Command.  Development  of  a guide  and  checklist 
for  Human  Factors  Evaluation  of  Army  equipment  and  systems.  U.  S.  Army  Test 
and  Evaluation  Command  (TECOM) , 1973.  Contract  DAAD05- 73-C-038S . 

Adjectives  ° 


-256- 


U . b.  Dept.  Army,  AGO,  PRB.  A study  of  officer  rating  methodology:  IV. 
Etfect  of  forced  choice  items  on  validity  of  ratinK  scales.  Personnel 
Research  Branch  Report  No.  903,  1952  (a). 


Forced  Choice  Items,  Military  Personnel, 
Raters,  Rating  Scales 

Psych.  Abst . , 28 , #8182 


U.  S.  Dept.  Army,  AGO,  PRB.  A study  of  oificer  rating  methodoioRy.  VIII. 
Validity  of  two  types  of  rating  techniques:  Forced- choice  items  and  ratine 


sea  les.  Personnel  Res.  Br.  Rep.,  No.  907.  Washington;  American  Documen- 
tation Institute,  1952  (b). 

Check  List,  Forced  Choice  Items,  Military  Personnel,  1,2 

Raters,  Rating  Scales 


Psych.  Abst. , 28,  #8186 


Van  Der  Veen,  F. , Howard,  K.  I . , & Austria,  A.  M.  Stability  and  equiva- 
lence scores  based  on  three  different  response  formats.  Proceedings  of 
t.ie  78th  Annual  Convention  of  the  American  Psychological  Association.  1970, 


99-100  (Summary) 


Card  Sorts,  Multiple  Choice  Items,  True-Fatse  Items, 
Response  Alternatives,  Reliability,  Response  Bias 


2,  8,  12 


van  Naerssen,  R.  F.  Two-choice  items  in  study-tests.  Nederlands  Tiidschrift 
voor  de  Psychologie  en  haar  Grensgebieden , 1970,  2^(6),  393-403. 


Vaughn,  C.  L.  A scale  for  assessing  socio-economic  status  in  survey 
research.  Public  Opinion  Quarterly,  1958,  22,  19-34. 


Vaughn,  G.  M,  Eliminating  a non-scale  type  from  a social  distance  scale. 
Psychological  Reports,  1962,  J_l_(3),  912.  '■ 

Attitude  Measures,  Clarity  3g 


-257- 


„ ^ r V Adiective  rating  scales  for  self  descrip- 


Adjective  Rating  Scales 

M-i^^»«riare  Behavioral  Research^  5,  p.  295 


18 

A-NA 


Vernon,  P.  E. 

F.  S.  Bartlett, 
Trench,  Trubner 


Questionnaires,  attitude  tests,  and 

et  al.  (Eds.)  The  study  of  society. 
1939  and  19^6. 


Questionnaire  Theory  and  Development 


rating  scales. 
London;  Kegan 


14 


R-N 


In 

Paul , 


ORA 


Vernon,  P.  E. 


Personality  tests  and  assessment. 


London;  Methuen,  1962. 


Vernon , 
tests. 


P.  E.  Effects  of 
British  Journal  of 


administration  and  scoring 
Education  Psychology,  1971, 


on  divergent  thinking 
41(3),  245-257. 


Vicary,  J.  M. 
Public  Opinion 


The  circular  test  of  bias  in 
Quarterly , 1955,  19,  215-218. 


persona  1 


interview  surveys. 


Vidich,  A.  J. 
survey  data. 


. Shapiro,  G.  A comparison 
American  Sociological  Revre_w,  1955,  20,  28  33. 


and 


Vinacke,  W. 
strategy. 
Report  No. 


p Two  tests  to  measure  exploitative  anl 
Buffalo,  N.Y.;  State  University  of  New  York 

7. 


Respondent's  Motivation 


accommodat ive 
, 1964,  Technical 


18 

A-NA 


ORA 


Vin  nt,  N.  L.  The  develo^ent 
nissertation  Abstracts,  1961,  — 


of  a new 
1997. 


attitude 


measurement 


technique . 


-258- 


Voas  R B.  A procedure  for  reducing  the  effects  of  slanting  questionnaire 
responses  toward  social  acceptability.  Educational  and  Psycholoaical 
Measurement , 1958  (a),  2^,  337-345. 

Voas  R B.  Relationships  among  three  types  of  response  set_s.  USN  School 
of  Aviation  Medical  Projects  in  Reseaich.  Subtask  1,  Prj.  No.  NMO  , 

1958  (b).  Report  No.  15. 


Voas,  R.  B.,  Blari,  J.  T. , & Ambler,  R.  K.  Validity  of  personality  inven- 
tories in  the  Naval  aviation  selection  program.  L'SN  School  of  Aviation, 
Subtask  1,  Prj.  No.  NM  16  01  11,  1957.  Report  No.  13. 


Military  Personnel,  Personality  Measures 
Psych.  Abst. , 33 , #853 


Volkmann,  J.  The  method  of  single  stim.uli.  American  Journal  of  Psychol- 
ogy . 1932,  808-809. 


Volkmann,  J.  The  relation  of  the  time  of  judgment  t.  the  certainty  of 
judgment.  Psychological  Bulletin,  1934,  672-673. 


* Volkmann,  J.  The  anchoring  of  absolute  scales.  Psychological  Bulletin, 
1936,  33,  742-743. 

Instrument  Format,  Investigator  Error  3f 


Volkmann,  J.  The  natural  number  of  categories  in  absolute  judgment, 
Psychological  Bulletin,  1937,  34,  543-544. 

Response  Alternatives  3a 


Volkmann,  J.  The  compression  of  an  absolute  scale.  Psychological  Bulletin, 
1938,  15,  676. 


-259- 


Columbia  University, 


Volkmaan,  J.  Lecture^in  the  psycholoay^  iudgnent  ■ 
1942.  (Unpublished.) 

N/A 

N/A 


N/A 

T-M 


Volkmann,  J.  Scales  of  judgment  and  their  implications 
ology.  In  Rohrer,  J.  H.  . & Sherif,  M.  Social  psycho, Ip^ 
the  University  of  Oklahoma  Lectures  in  social  psychologx. 
Harper,  1951.  Pp.  273-298. 


for  social  psych- 
at  the  crossroads 
New  York; 


Rating  Scales,  Textbook 


15 


Psvch.  Abst.  , 26,  i/83b  (Rev.  from  rept.) 


Volkmann 

function 

277-284. 


J Hunt  W.  A.,  McGourty,  M.  Variability  of  judgment  as 
of ‘stimulus-density.  American  Journal  of  Psychology.  1940, 


a 

11. 


Votaw,  D.  F.,  & Danforth,  L.  The 
validity  of  multiple-choice  tests. 
1939,  624-627. 


■fleet  of  method  of  response  upon  the 
Journal  of  Educational  Psychology , 


Waener  1.  F.  Articulate  and  inarticulate  replies  to  questionnaires. 
Journal  of  Applied  Psychology,  1939,  ^(D  , 104-115. 


Wagner,  R.  F.  A 
securing  personnel 


group  situation  compared  with  individual  interviews  tor 
information.  Personnel  Psychology,  1948,  J_,  93-107. 


Wagner,  R.  K.  An  investigation  of  government 
orpanizational  climate.  Wright -Patter son  AFB 
of  Technology,  1971.  Report  No.  GSM  SM  71-13. 


employee  perceptions  of  their 
Ohio:  Air  Force  Institute 


Interviews,  Military  Personnel, 


Preference  Measures 


18 


ORA 


A-NA 


Wahler,  H.  J.  Response  styles  in  clinical  and  nonclinical  groups.  Ju 
■lournal  of  Consulting  Psychology,  1961  , 25  , 5J3-539. 


Wakefield,  J.  A,  A reply  to  Dr.  Adkins.  Public  Personnel  Review,  1958(b) 
_y,,  298-300. 


Wakefield,  J.  A.  Does  the  fifth  choice  strengthen  a test  item?  PubUe 
Opinion  Quarterly,  1958(a),  J_9,  45-48. 

i 8 

Achievement  Measures 

Psvch.  Abst.  , 33,  If 7029  ^-N 


Waksberg,  J.  Conditioning  effects  from  repeated  household  interviews. 
Journal  of  Marketina,  1964,  ^(2),  51-56. 


Walker,  D.  A.  Answer-pattern  and  score-scatter  in  tests  and  examinations. 
British  Journal  of  Psychology,  1940,  30,  248-260. 


Walker,  H.  W.  Certain  mathematical  questions  suggested  by  true-false  test. 
Amer . Math.  Mo. , 1927,  34,  504-515. 


Walker,  K.  P.  Examining  personnel  information  items  of  a questionnaire 
study.  Journal  of  Educational  Research.  1937,  2i>  281-282. 


Wallace,  D.  Mail  questionnaires  can  produce  good  samples  oi  homogeneous 
groups.  Journal  of  Marketing,  1947,  XII(l) , 53-60. 


Walsh,  J.  A.,  Jr.  A factorial  study  of  a large  sample  of  response  set 
and  attitude  scales.  Dissertation  Abstracts.  1964,  ^(11),  4824-4825. 

Walsh,  J.  A.  Prediction  of  anchor  effects  on  personality  items  from  rating 
dispersions.  Educational  and  Psychological  Measurement,  1968,  ^(2),  317-325. 

Personality  Measures,  Response  Alternatives  3f,  14 


* Walsh,  W.  B.  Validity  of  self-report.  Journal  of  Consulting  Psychology, 
1967  , 14(1)  . 18-23. 

Interviews,  Closed-Ended  Items,  Respondent  s 
Motivation 


Walsh,  W.  B.  Validity  of  self-report;  Another  look.  Journal  of  Counsel- 
ine  PsvcholoRV , 1968,  H(2) , 180-186. 


Walsh  W.  B.  Self-report  under  socially  undesirable  and  distortion  con- 
ditions." Journal  of  Counseling  Psycho IgaZ.  1969,  16(6),  569-574. 


Walters,  J.  H.  Structured  or  unstructured  techniques?  Journal_of  Market- 
ing , 1961,  2^,  58-62. 

Open-Ended  Items 


Wang,  C.  K.  A.  Suggested  criteria  for  writing  attitude  statements. 
Journal  of  Social  Psychology,  1932,  1(3),  367-373. 


Ward  C.  D.  Ego- involvement  and  the  absolute  ludgment  of  attitude  stat_e- 
ments.  Paper  to  American  Psychological  Association,  Annual  Meetings, 
Philadelphia,  1963.  (mimeographed) 


Ward  C.  D.  Multiple-choice  question  writing;  Research  participation  and 
exam’performance.  Journal  of  College  Science  Teaching,  1973,  3(1),  77-78. 


-262- 


Ward,  C.  D.  Issue  saliency  and  the  correspondence  between  measureti  of 
attitude.  College  Park,  Maryland:  Maryland  I’niversity,  Dept,  of  Psycho- 

logy, 1969.  Report  No.  TR-I5. 

Scaling,  Semantic  Dilferential  Items, 

Attitude  Measures  2,  10 

DDC,  # AD  695  000  A-H 


Warner,  S.  L.  Randomized  response;  A survey  technique  for  eliminating 
evasive  answer  bias.  Journal  of  the  American  Statistical  Association. 
1965  , ^(309)  , 63-69. 


Warren,  G.  S.  Item  content  and  format  as  evidence  for  response  biases. 
Dissertation  Abstracts  International,  1972  , 3^(6-A)  , 2778, 


Investigator  Error,  Response  Bias 


10,  12 


ORA 


R-H 


Warren,  R.  D. , & others.  Moderator  effects  on  attitude  scale  construction. 
Home  Economics  Research  Journal,  1973,  _1(,4)  2,59-268. 


Waters,  C.,  & Waters,  L.  K.  Effect  on  number  of  alternatives  and  scoring 
instructions  on  examinees'  reactions  to  multiple-choice  tests.  Psycho  log- 
ical Reports,  1971(a),  2£,  1229-1330. 


Waters,  C.  , & Waters,  E.  K.  Validity  and  likability  ratings  for  three 
scoring  instructions  for  a multiple-choice  vocabulary  test.  Educat iona i 
and  Psychological  Measurement.  1971  (b) , ^(4),  935-938. 


Waters,  L.  K.  Effects  of  instructions  and  item  tone  to  forced-choice 
pairs.  Personnel  PsycholoRy,  1966,  j^CI),  45-53. 


Rating  Scales,  Forced-Choice  Items, 
Respondent's  Motivation 


2,  7,  11,  3g 


Waters,  L.  K. , & Wherry,  K.  J.,  Jr.  EvaLuation  of  two  forced-choice 
response  formats.  Personnel  Psychoiogy,  1961(a),  , 285-289. 

Forced-Choice  Items,  Rating  Scales,  Instrument  2,  11,  3g 

Format,  Respondent's  Motivation 

Psych.  Abst.  , #5734  A-H 


Waters,  L.  K. , & Wherry,  R.  J.  A note  on  the  stability  of  the  preference 
index  in  forced-choice  blocks.  USN  School  of  Aviation  Medical  Research 
Report  No.  3,  1961(b). 

Check  List,  Forced-Choice  Items  2 

Psych.  Abst.  , #4285  A-M 


Waters,  L.  K.,  & Wherry,  R.  J.,  Jr.  A note  on  alternative  methods  of 
scoring  a forced-choice  form.  Personnel  Psychology,  1962 (a)  , _L5 (3) , 315-317. 

Scoring  Validity,  Forced-Choice  Items  8 

Psych.  Abst.  , #7265  A-M 


Waters,  L.  K. , & Wherry,  R.  L.,  Jr.  The  preference  index  and  responses 
to  forced-choice  pairs.  Personnel  Psychology,  1962(b),  JJ^(2)  , 99-102. 


Watson,  J.  J.  Improving  the  response  rate  in  mail  research.  Journal  of 
Advertising  Research,  1965,  ^(2) , 48-50. 

Respondent's  Motivation  11 

ORA 


Weaver,  H.  G.  Consumer  questionnaire  technique.  American  Marketing 
Journal  , 1934,  _^(3),  115-118. 


Webb,  E.  J. , Campbell,  D.  f. , Schwartz,  R.  D. , and  Sechrest , L.  Unobtru- 
sive measures:  Nonreactive  research  in  the  social  sciences.  Chicago:  Rand 
McNally,  1966. 


-264- 


Webb,  J.  T.  Subject  speech  rates  as  a function  of  interviewer  behavior. 
l-anguaae  and  Speech.  1969,  J_2(l),  54-67. 


Webb,  S.  C.  Scaling  of  attitudes  by  the  method  of  equal-appearing  inter- 
vals: A review.  Journal  of  Social  Psychology,  1955,  215-239. 


Webb,  S.  C. , & Chueh,  J,  C.  The  effect  of  role  taking  on  the  judgment 
of  attitude  items.  Paper  to  American  Psychological  Association,  Annual 
Meetings,  St.  Louis,  1962.  (mimeographed) 


Webb,  W.  B.  Self-evaluation  compared  with  group  evaluations.  Journal  of 
Consulting  Psychology.  1952  , j_6,  305-307. 

Rating  Scales,  Raters  12 

Psych.  Abst.  , 11_,  #4236  A-N 


Webb,  W.  B.  A procedure  for  obtaining  self-ratings  and  group  ratings. 
Journal  of  Consulting  Psychology,  1956,  233-236. 


Webster,  M.  Correcting  personality  scales  for  response  sets  or  suppression 
effects.  Psychological  Bulletin.  1958,  62-64. 

* Webster,  M.  The  meaning  of  response  set  in  personality  inventories. 
American  Psychologist,  1960,  j_5(7),  431.  (Abstract.) 

Personality  Measures,  Response  Bias  12 

American  Psychologist.  1 5 , p.  431  A-M 


Webster,  H.  Acquiescence,  social  desirability  and  inhibition  reflected 
by  "response  set"  scales.  Psychological  Reports.  1962,  j_0,  789-790. 


-265- 


r 


■1 


* Wedell,  C.  , & Smith,  K.  LI.  Constancy  of  interview  metliods  in  appraisal 
of  attitudes.  Journal  o*~  Applied  PsvclioloKV . 1^51,  Jb,  392-396. 

Interviews,  Investigator  Error,  Rating  Scales  1,  2,  7, 

13,  12 

Psych.  Abst.,  26 , #6242  (Rev.  from  rept.)  R-11 


Weick,  K.  E.  Systematic  observational  methods.  In  Lindzey,  G.  , & 
Aronson,  E,  (Eds.),  The  handbook  of  social  psychology.  Volume  2.  (2nd 
edition.)  Reading,  Mass.;  Add ison-Wes  ley , 1968.  Chapter  13. 


Weidemann,  C.  C.  The  omission  as  a specific  determiner  in  the  true-false 
examination.  Journal  of  Educational  Psychology,  1931,  2^,  435-439. 


Weidemann,  C.  C.,  & Newens,  L.  F.  Does  the  compare  and  contrast  essay 
test  measure  the  same  mental  functions  as  the  true-false  test?  Journal  of 
General  Psycholoav,  1933  (a)  , 9_,  430-449. 

Weidemann,  C.  C.,  & Newens,  L.  F.  The  effect  of  directions  preceding  true- 
false  and  indeterminate  statement  examinations  upon  distribution  of  test 
scores.  Journal  of  Educational  Psychology,  1933  (b)  , 24,  79-106. 


Weiner,  M.  , & Tobias,  S.  Chance  factors  in  the  interpretation  of  group 
administered  multiple-choice  tests.  Personnel  and  Guidance.  1963,  41(5)  ,435-437 . 


Data  Analysis,  Scoring,  Investigator  Error 
Psych . Abst . , 39 , #1771 


8 

A-N 


Weinland,  J.  D.  Better  words  on  rating  scales.  Personnel  Journal.  1946, 
25,  131-134. 


Response  Alternatives,  Rating  Scales 
ORA 


3f 

R-N 


Weinstein,  A.  G.  Predicting  behavior  from  attitudes.  Public  Opinion 
Quarter  ly  . 1972,  ^(3),  355-360. 


-266- 


Weiss,  D.  J.  Averaging:  An  empirical  validity  criterion  ior  magnitude 

estimation.  Perception  it  PsychopliNtsics , 1972  , J_2(5),  385-388. 


Weiss,  D.  J.,  Dawis,  R.  W.  An  objective  validation  of  factual  inter- 
view data.  Journal  of  Applied  Psychology,  1960,  381-385. 


Weiss,  D.  J.,  Dawis,  K.  W.  , England,  G.  W.  , ic  Lofquist,  L.  11.  Validity  of 
work  histories  obtained  by  interview.  Minnesota  Studies  in  Vocational  Re- 
hab i litat ion , 1961,  J^,  1-3. 


Weiss,  W.  The  effects  on  opinions  of  a change  in  scale  judgments.  Journa 1 
of  Abnormal  and  Social  Psychology,  1959,  329-334. 

* Weiss,  W.  Effects  of  an  extreme  anchor  on  scale  judgments  and  attitude. 
Psychological  Reports,  1961,  377-382. 

Response  Alternatives,  Rating  Scales  3f 

Psychological  Reports,  36  , #2GD  FEW  A-H 


Weiss,  W.  Effects  of  response  scales  on  the  expected  distribution  of 
judgments  of  social  stimuli.  Psychological  Reports.  1963(a),  13(4), 
411-414. 

Scaling,  Response  Alternatives  2,  ib 

Psych.  Abst . , 38 , #7079  A-M 

* Weiss,  W.  Effects  of  unbalanced  response  scales  or)  judgments  of  social 
stimuli.  Psychological  Reports.  1963  (b)  ,...b2  (2)  , 403-414. 

2,  3f,  3a,  14 
A-H 


-267- 


Scaling  Response  Alternatives 
Psych . Abst.,  38  , #4156 


•V 


Weiss,  W,  Scale  judgments  of  triplets  of  opinion  statements.  Journal 
of  Abnormal  and  Social  PsvchoioKV,  196J  (c),  471-4  79. 

Investigator  Error,  Response  Alternatives  3g,  3a 

Journal  of  Abnormal  and  Social  Psycholoy.V  , ^ , R - NA 

p.  47' 


Weitman,  M.  Forms  of  failure  to  respond  and  varieties  of  authoritarianism. 
Journal  of  Pe~~sonalit  v , 19t>4,  J^(  1 ) , 109-118. 

Attitude  Measures,  Open-Ended  Items, 

Respondent's  Motivation  10 

Psych,  Abst . , 39 , #5164  A-M 


Weitz,  J.  Verbal  and  pictorial  questionnaires  in  market  research.  Journal 
of  Applied  Psychology,  1950,  363-366. 

Clarity,  Preference  Measures  2,  3g,  4 

Psych.  Abst.,  26 , #588  (Rev.  from  rept.)  R-H 


Weitz,  J.  , Nuckols,  R.  C.  The  validity  of  direct  and  indirect  questions 
in  measuring  job  satisfaction.  Personnel  Psychology.  1953,  487-494. 

Question  Stem  _ , 3g 

Psych.  Abst.  , 28,  #8223  A-M 


Weitz,  S.  Attitude,  voice,  and  behavior;  A repressed  affect  model  of  inter- 
racial interaction.  Journal  of  Personality  and  Social  Psychology,  1972, 
24(1),  14-21. 


Weitzman,  R.  A. 


* 


Wells,  W.  D.  The  influence  of  yea  saying  response 
Advertising  Research,  1961,  J_,  1-12. 

Response  Bias,  Literature  Review 

ORA 


style.  Journal  of 

12 

R-H 


Wells,  W.  D.  How  chronic  overclaimer s distort  survey  lindings.  Journaj_ 
of  Advertising  Research.  1963,  2(2),  8-18. 

Personality  Measures,  Response  Bias,  Literature  12,  16,  9 

Review 


ORA 


R-H 


Wells,  W.  D.  , & Dames,  J.  Hidden  errors  in  survey  data.  Journal_qf, 
Marketing , 1962,  2^(^) > 50-54. 


* Wembridge,  E.  R. , & Means,  E.  R.  Obscurities  in  voting  upon  measures  due 
to  double-negative.  Journal  of  Applied  Psychology,  1918,  2,  15.---163. 

Question  Stem 

Bauman,  Rogers,  and  Weiss,  1971 


Weschler,  1.  R.,  & Bernberg,  R.  E.  Indirect  methods  of  attitude  measure- 
ment. International  Journal  of  Opinion  and  Attitude  Research,  1950, 
209-228. 


Wesman,  A.  G.  Some  effects  of  speed  in  test  use.  Educational  and  Psych- 
ological Measurement,  1960,  20»  267-274. 


-269- 


Wesman,  A.  G.  Active  versus  blank  responses  to  multiple-choice  items. 
Journal  of  Educational  PsycholoRV.  1947,  89-^5. 


Check  List,  Multiple  Choice  Items 


3g,  4,  3a 


Westfall,  R.  L.,  Boyd,  H.  W.,  & Campbell,  D.  T.  The  use  of  structured 
techniques  in  motivation  research.  Journal  of  Marketing,  1957,  134-139. 

Forced-Choice  Items,  Projective  Items  2 


Wevei , E.  G-,  & Zener,  K.  E The  method  of  absolute  judgments  in  psycho- 
physics. Psychological  Review,  1923,  466-493. 


Wheatley,  B.  C.,  & Cash,  W.  B.  Employee  survey:  correcting  its  basic 
weakness  (questionnaire  technique).  Personnel  Journal.  1973,  456-459. 


* Wheatley,  J.  J.  Self-administered  written  questionnaires  or  telephone 
interviews?  Journal  of  Marketing  Research.  1973,  9,  94-95. 


Interviews,  Semantic  Differential  Items 


1,  2,  3g 


Wheeler,  R.  W.  A study  of  the  relationship  between  selected  interviewer 
variables  and  the  interpretation  of  interview  information.  Disser  ta  tion 
Abstra  ts  International,  1969,  ^ (1-B),  425. 


Wherry,  R.  J.  Orders  for  tne  presentation  of  pairs  in  the  method  of  paired 
comparisons.  Journal  of  Experimental  Psychology,  1938,  23,  651-660. 


Whe  r ry , R J Control  of  bias  in  ratina:  Factor  analysis  of  ratinR  item 
indices.  AGO,  Personnel  Research  Section,  1951.  Report  No.  915. 


Wherry,  R.  J.,  Jr.  A test  of  new  rationale  and  methodoloRV  tor  the  forced- 
choice  technique  (AD  237  782).  Pensacola,  Florida:  Naval  School  of  Aviation 
Medicine,  I960.  Project  MR005  13  5001. 

N/A  N/A 
N/A  . R-NA 


Wherry,  R.  j.,  & Fryer,  D.  H.  Buddy  ratings:  Popularity  contest  or  leader- 
ship criteria?  In  Monroe , J.  L.  (Ed.),  The  sociometry  reader.  Glencoe, 
Illinois:  The  Free  Press,  1960. 


Whipple,  J.  W.  A study  of  the  extent  to  which  positive  or  negative  phras- 
ing affects  answers  in  a true-false  test.  Journal  of  Educational  Research, 
1957,  JT,  59-63. 

Question  Stem,  Investigator  Error  13 
Psych.  Abst.  , 32,  #2001  A-N 


Whitlock,  G.  H.  Validation  of  moiale  and  attitude  scales  (AD  242  359). 
Knoxville,  Tennessee:  University  of  Tennessee,  1960.  Technical  Report  No. 
6C-76. 


Whitran,  J.  R.,  & Schwartz,  A.  N.  The  relationship  between  two  measures 
oi  the  tendency  to  give  socially  desirable  responses.  Journal  of  Projec- 
tive Techniques,  1967,  ^(5),  72-75. 


Whitney.  D.  R.,  & Feldt,  L.  S.  Analyzing  questionnaire  results:  Multiple 
tests  ol  hypotheses  and  multivariate  hypotheses.  Educational  and  Psychol- 
ogical Measurement,  1973,  3^(2),  365-380. 


Whyte,  W. 
21-23. 


On  asking  indirect  questions. 


Human  Organization,  1957,  Jk5(4) 


Wick,  J.  W.  Similar  response  analysis.  Educational  and  Psychological 
Measurement , 1970,  20(1)>  95-110. 


Wicker,  M.  P.  A comparison  of  attitude  scale  values  yielded  by  scales  of 
differing  lengths.  University  of  North  Carolina .. .Research  in  Progress. 
University  of  North  Carolina  Rec.,  1951,  ^(492),  2'52 . (Abstract  of  Master's 
thesis . ) 

* Wickes,  T.  A.,  Jr.  Examiner  influence  in  a testing  situation.  Journal  of 
Consulting  Psychology,  1956,  23-26. 

Investigator  Error,  Respondent's  Motivation  10,  11 

Psych.  Abst.  , #3090  A-H 


Wiener  G.  The  effect  of  distrust  on  some  aspects  of  intelligence  text 
behavior.  Journal  of  Consulting  Psychology,  1957,  127-130. 


Wiener,  M.  , Carpenter,  J.  T.,  & Carpenter,  B.  Some  determinants  of  con- 
formity behavior.  Journal  of  Social  Psychology,  1957,  289-297. 


Wiggins,  J.  S.  Interrelations  among  MMPI  measures  of  dissumula tion  under 
standard  and  social  desirability  instructions.  Journal  of  Consulting 
Psychology , 1959,  319-427. 


Wiggins,  J.  S.  Convergences  among  stylistic  response  measures  from  objec- 
tive personality  tests.  Educational  and  Psychological  Measurement,  1964, 
551-562. 


Wiggins,  N.  Individual  viewpoints  of  social  desirability.  Psychologica 1 
Bu  1 le  tin  , 1966  , 68-77. 


-272- 


Wight,  A.  H.  A study  of  rater  and  ratee  characteristics.  Disserta  tion 
Abstracts  International,  1969,  ^(5-B),  2451. 


Personality  Measures 


Wilbourn,  J.  M.,  & Quinn,  N.  Feasibility  of  using  special  measures  in  the 
classification  and  assignment  of  lower  mental  ability  airmen.  Brooks  AFB, 
Texas:  Air  Force  Human  Resources  Laboratory,  1973. 

Ai-hievemer.t  Mc.nsures  18 


Wilbur,  P.  H.  Positional  response  set  among  high  school  students  on  mul- 
tiple-choice tests.  Journal  of  Educational  Measurement,  1970,  ]_,  161-164. 


Wilcove,  G.  L.  Enlisted  men's  and  officer's  opinions  of  recent  policy 
changes  implemented  through  Z -grams.  Washington,  D.  C.:  Naval  Personnel 

Research  and  Development  Laboratory,  1971.  Report  No.  WSR-72-5. 


Military  Personnel 


Wilde,  G.  J.  S.,  & DeWitt,  O.E.  Self-report  and  error  choice:  Inter- 
individual differences  in  the  operation  of  the  error-choice  principle 
and  their  validity  in  personality  questionnaire  tests.  British  Journal 
of  Psychology,  1970,  ^(2),  219-228. 


Wiley,  C.  F.  The  three-decision  multiple-choice  test:  A method  of  in- 
creasing the  sensitivity  of  the  multiple  choice  item.  Psychological  Ret 
1960,  2,  475-477. 


Wiley,  L.,  Harber,  H.  B.,  & Giorgia,  M.  J.  Evidence  for  a generalized 
rating  tendency.  Engineering  and  Industrial  Psychology,  1959  (a),  1_,  55-61. 

Raters,  Investigator  Error  13 

Psych.  Abst . , 35 , #5394  (Rev.)  A-N 


Wiley,  L.,  Harber,  H.  B. , & Giorgia,  J.  M. 
Qualifications  required  by  Air  Force  tasks. 
1959(b).  Technical  Note  No.  59-195. 


Rater  tendencies  in  estimating 
HSAF  WADC  Personnel  Laboratory, 


N/A 

N/A 


N/A 

T-H 


Wiley,  L.,  & Jenkiiis,  W.  S Method  for  measuring  bias  in  raters  who 

estimate  job  qualifications  Journal  of  Industrial  Psychology,  1963,  ^(1), 

16-22. 

Raters,  Investigator  Error  12 

Psych . Abs  t . , 38 , #6735  A-N 


Wiley,  L.  N.,  & Trimble,  0.  C.  The  ordinary  objective  test  as  a possible 
criterion  of  certain  personality  traits.  School  and  Society,  1936,  43 , 

446 -448 . 


Wilkie,  W.  L.,  & Pessemier,  E.  A.  Issues  in  marketing's  use  of  multi- 
attribute attitude  models.  Journal  of  Marketing  Research,  1973,  428. 


Wilkins,  L.  A.  Suggestions  as  to  the  formulation  of  questions  in  standard- 
ized examinations  in  modern  languages.  New  York  Bulletin  High  Points.  1923. 
27-36. 


* Wiii-.ins,  L.  T.  Incentives  and  the  young  male  worker  in  England;  with  some 
notes  on  ranking  methodology.  International  Journal  of  Opinion  and  Attitude 
Research , '950,  4,  541-562. 

Paired  Comparison  Items,  Ranking,  Preference 

Measures,  Military  Personnel  2,  12 

Psych . Abst . . 26  . #570  (Rev.  from  rept.)  R-H 


-274- 


Will,  R.  T.,  6.  nasty,  R.  W.  Attitude 

multiple  stimuli.  rnurnal  of  Marketin&  Research . 1971,  35, 


Preference  Measures,  Rating  Scales 
ORA 


15 

R-NA 


Willcock,  H.  D.  Mass  observation.  American  of  Socio_loi^.  1943 

48,  445-456. 


Uiilev  C F The  three-decision  mul t iple -choice  test:  A method  of 

cieasing  ^he’ sensitivity  of  the  multiple-choice  item.  Psychologtca  1 Repo^, 

1960,  X,  475-477 . 


T A Tr  Interviewer-respondent  interaction:  A study  of  bias 

Williams,  J.  A.,  Jr.  tnterviewei  le  ^ oirw  158-352 

in  the  information  interview.  Sociome^rji,  1964,  27(3),  338  352. 


Williams,  J.  A.,  Jr. 

Englewood  Cliffs, 


in  the  information  interview. 

Stages  of  social  research:  Contemporary  perspec tj,ves . 
Prentice-Hall,  Inc.,  1970. 


New  Jersey: 


* 


Williams,  W.  H.  The 
American  Statistical 
Section,  Pittsburgh, 


svstematic  bias  effects  of  incomplete  responses. 
Association.  Proceedings  of  the  Social  Statist^c£ 
Penn.,  1968,  11th  annual  edition,  1968,  308-312. 


Data  Analysis 


1 


ORA 


R-N 


Williams,  W.  S.  A study  of  the  use  of  the 
grade  children  from  different  socioeconomic 
1972,  81(3),  343-50. 


semantic  differential  by  fifth 
groups.  Journal  of  Psychology , 


Willingham,  W.  W.,  & Jones,  M.  B. 
analysis  of  variance.  Educational 
18,  403-407. 


On  the  identification  of  halo 
and  Psychological  Measurement, 


through 

1958, 


Willis,  R.  H.  Manipulation  of  item  marginal  frequencies  by  means  of 
multi-response  items.  PsyclioloRical  Review,  1960,  32-50. 

Attitude  Measures,  Response  Alternatives, 

Multiple-Choice  Items,  Scoring,  Rating  Scale  8,  2 

Psych.  Abst.  , 35 , #739  (Rev.  from  rept.)  R-H 


Willis,  R.  H.  The  phenomenology  of  shifting  agreement  and  disagreement 
in  dyads.  Journal  of  Personality,  1965,  22.,  188-199. 


Winters,  S.,  & Bartlett,  C.  J.  Instructional  and  response  style  factors 
with  forced  choice  response.  Research  Reports,  Sinai  Hospital  of  Balti- 
more , 1966,  2,  10-16. 

Response  Bias,  Forced- Cl:o ice  Items,  Investigator  Error  12,  7 

Psych.  Abst..  #12852  A-H 


Winthrop,  H.  Reliability  of  preference  ratings  as  a function  cf  cardinal 
value  and  natural  order.  Psychological  Reports.  1958,  4,  62. 

Scaling,  Reliability,  Instrument  Format  3b 

Psychological  Reports.  4,  p.  62  (rev.  from  rept.)  R-H 


Wiseman,  F.  Methodological  bias  in  public  opinion  surveys.  Public 
Opinion  Quarterly,  1972,  36(1),  105-108. 

Interviews,  Response  Bias,  Attitude  Measures, 

Respondent’s  Motivation  ^ 12,  11 

ORA  R-H 


Witryol,  S.  L.  , 6i  Fischer,  W.  F.  Scaling  children's  incentives  by  the 
method  of  paried  comparisons.  Psychological  Reports.  1960,  7,  471-474. 


* Witroyl,  S.  L.  Scaling  procedures  based  on  the  method  of  pa-lred-compar isons ■ 
Journal  of  Applied  Psychology.  1954,  3^,  31-37. 

Scaling  14 

Psych . Abs t . , 29,  #125  A-M 


* Witroyl,  S.  L.,  & Thompson,  G.  C.  An  experimental  comparison  of  the  stabil- 
ity of  social  acceptability  scores  obtained  with  the  partial-rank-order  and 
the  paired-comparison  scales.  Journal  of  Educational  Psychology,  1953,  44 , 
20-30. 

Response  Alternatives,  Paired  Comparison  Items,  Reliability  2,  8 


Psych.  Abst.  , 28 , #735  (Rev.  from  rept.) 


Wofford,  J.  C.,  & Willoughby,  T.  L.  The  effects  of  test  construction  vari- 
ables upon  test  reliability  and  validity.  California  Journal  of  Educational 
Research , 1969,  ^(3),  96-106. 


* Wolfe,  D.  F.  A new  questionnaire  design.  Journal  of  Marketing,  1956, 
20,  186-190. 


Instrument  Format 


15,  17 


Wollack,  S.,  Witjing,  J.  P.,  Goodale,  J.  G.,  & Smith,  P.  C.  Weighting 
agreement  responses  by  item  scale  values.  Journal  of  Applied  Psychology, 
1970,  54(2),  174-175. 

Rating  Scales,  Attitude  Measures  2 


Womer , F.  B.  The  valuation  of  item  selection  techniques  appropriate  to 
a new  response  method  for  multiple-choice  type  test  items.  Dissertation 
Abs  trac  ts  , 1957,  1_7,  98. 


-277- 


winder  Lie  , E . F . 
1953,  22,  91-93. 


Personnel  Journal 


We  survey  attitudes  annually  by  mail. 


Wood,  D.  A.  Test  construction,  development,  and  interpretation  of  achiev- 
ing tests.  Columbus,  Ohio:  Charles  E.  Merrill,  1961. 


Woodrow,  D.  Multichoice  Lest  marking  scheme- -Eva  1 . Australian  Science 
Teachers  Journal,  1973,  J^(2)  , 57-62. 

Achievement  Measures,  Scoring  18 

ERIC  Document  Reproduction  Service,  EJ  083  425  A-N 


Woodson,  E.  J.  Human  factors  engineering.  New  York;  McGraw-Hill,  1964. 


Woodward,  J.  L.  Making  government  opinion  research  bear  upon  operation. 
American  Sociological  Review,  1944,  9,  610-617 . 


Worthy,  M.  Note  on  scoring  midpoint  responses  in  extreme  response-style 
scores.  Psychological  Reports,  1969,  ^(1),  189-190. 

Response  Bias,  Data  Analysis,  Rating  Scales  8,  12 

ORA  R-M 


Wright,  C.  E.  A factor  dimension  comparison  of  normative  and  ipsative 
measurements.  Educational  and  Psychological  Measurement,  1961,  433-444 

Data  Analysis,  Scoring,  Personality  Measures  8 

Psych.  Abst. , 36 , #2HE33W  A-M 


Wright,  0.  R.,  Jr,  Summary  of  research  on  the  selection  interview  since 
1940.  Personnel  Psychology,  1969,  22  (U) , 391-413. 


Wrightsman,  L.  Characteristics  of  positively  scored  and  negatively  scored 
items  from  attitude  scales.  Psychological  Rcports^,  1965,  J_7,  898. 


Wrigley  C.,  & Neuhaus,  J.  0.  The  matching  of  two  sets  of  factors. 
Contract  Memorandum  Report,  A-32.  Urbana , 111.:  University  of  Illinois 

1955. 


Wyatt,  D.  F.,  & Campbell,  D.  T.  A study  of  interviewer  bias  as  related 
to  interviewers'  expectations  and  own  opinions.  International  Journal  of 
Opinion  and  Attitude  Research,  1950,  4,  77-83. 


Wylie,  A.  T.  To  what  extent  may  we  rely  upon  the  answers  to  a school 
questionnaire?  Journal  of  Educational  Method,  1927,  252-257. 


Yadoff,  B.  An  attempt  to  change  word  meaning  and  a personality  test  score 
through  semantic  generalization.  (Doctoral  thesis.  University  of  Pittsburgh) 
Pittsburgh,  Pa.  (DA  19:2157) 


Yates,  F.  Sanplin^.  methods  for  censuses  and  surveys.  (3rd  ed.)  New  York: 
Hafner,  1960. 


Yoell,  W.  A.  How  the  depth  interview  reveals  attitudes  toward  new  products. 
Printers'  Ink,  1947  , 218(6)  , 50-52. 


Irrelevant 


18 


ORA 


R-NA  • 


York,  C.  M.  SLab'lity  of  Thurstone  scale  values  after  35  years.  Perce£- 
tual  and  Motor  Skills,  1966,  ^(2),  628. 

Scaling,  Rating  Scales  2,  8 


0R<\ 


R-M 


Young,  P.  V.  Schedule  and  the  questionnaire  as  aids  in  field  exploration. 
In  Scientific  social  surveys  and  research^  (P.V.Y.),  Chapter  7.  New  York: 
Prentice  Hall,  1939.  Pp . 138-171. 

Young,  P.  V.  The  validity  of  schedules  and  questionnaires.  Journal  of 
Educational  Sociology,  1940,  1^,  22-26. 


Young  P V The  questionnaire  and  other  reporting  forms  as  aids  in  field 
exploration.  In  Capt,,  K.  G.  (Ed.),  Scientific  social  surveys  and  resear_ch. 
(3rd  ed.)  Englewood  Cliffs,  New  Jersey:  Prentice  Hall,  1956.  Chapter  VIII, 
pp.  176-204. 

Questionnaire  Theory  and  Development  17 


Young  P.  V.  Scientiiic  social  surveys  and  research:  An  introduction  to 
the  backiijund,  contZ-nt  methods,  principles,  and  analysis  of  social  studies. 
(2nd  edO  Englewood  Cliffs,  N.J.;  Prentice-Hall,  1966. 

Questionnaire  Theory  and  Development,  Textbook  17 


Zaccaria,  M.  A.,  Tupcs , E.  C.,  & Lawrence,  H.  G.  Development  and  charac- 
teristics  of  the  USAF  officer  activity  inventory.  United  States  Air  Force 
Per sonne'l  Training,  Research  Center,  Research  Report  No.  57-15,  1957. 

14,  IP 


Military  Personnel,  Interest  Measures,  Reliability 
Psych.  Abst.  , 33,  #9202 


A-NA 


* Zajonc,  R.  B.,  & Nienwenhyse,  B.  Relationship  between  word  frequency  and 
recognition-perceptual  process  or  response  bias?  Journal  of  Experimental 
Psychology . 1964,  ^(3),  276-285. 


Adjectives,  Response  Bias,  Questionnaire 
Theory  and  Development 


12,  6 


* Zavala,  A.  Development  of  the  forced-choice  rating  scale  technique. 
Psychological  Bulletin,  1965,  ^(2),  117-124. 


Forced-Choice  Items,  Rating  Scale,  Response 
Bias,  Respondent's  Motivation 

Psych . Abs t . , 39 , #6188 


2,  12,  11 


* Zavalloni,  M.  & Cook,  S.  W.  Influence  of  judge's  attitudes  on  ratings  of 
favorableness  of  statements  about  a social  group.  Journal  of  Personality 
and  Social  Psychology , 1965,  1(1),  43-54. 


10,  13 


Attitude  Measures,  Raters 

Journal  of  Personality  and  Social  Psychology, 
1,  p.  43  (Rev.  from  rept.) 


* Ziller,  R.  C.,  Long,  B.  H.  Some  correlates  of  the  don't  know  response  in 
opinion  questionnaires.  Journal  of  Social  Psychology,  1965,  67_,  139-147  . 


8,  12 


Attitude  Measures,  Response  Bias,  Response 
Alternatives 

Journal  of  Social  Psychology,  67 , pp . 139-147 
(Rev.  from  rept.) 


* Zimbardo,  P.  G.  Verbal  ambiguity  and  judgmental  distortion.  Psychological 
Reports . 1960,  6,  57-58. 

Raters  10 


Zimmer,  C.  E.  Chance  distribution  of 
paired  comparison  and  multiple  ranking 
mentation  Report  No.  63-1,  1963. 


inconsistent  response  patterns  in 
designs . USA.F  PRL  Technical  Docu- 


Paired  Comparison  Items,  Ranking 
Psych.  Abst.  , 37^,  #7396 


* Zinnes,  J.  L.  Scaling.  Annual  Review  of  Psychology,  1969,  W,  447-479. 
Literature  Review,  Scaling,  Data  Analysis  . 

R-M 

ORA 

Zubin,  J.  The  chance  clement  in  matching  tests,  Journal_of  Education^ 
Psychology , 1933,  2^,  674-681. 


Zubin,  J.  Nomograph.  Journal  of  the  American  Statistical  Associati^,  1939 
34,  539-544. 

Data  Analysis 


Zuckerman,  J.  V.''-  Interest  item  response 
crimination  between  professional  groups. 
1952,  79-85. 


arrangement  as  it  affects  dis- 
Journal  of  Applied  Psychology, 


Forced-Choice  Items,  Preference  Measures 


2,  3a 


Psych . Abs t . , 27 , #499  (Rev.) 


Zuckerman,  J.  V.  A note  on  "interest  item  response  arrangement".  Journal 
of  Applied  Psychology,  1953,  37(2),  94. 

Forced-Choice  Items,  Multiple  Choice  Items, 

Questionnaire  Theory  and  Development  2 


-282- 


Zuckerraan,  J.  V.  The  development  of  an  affective  adjective  check  list, 
for  the  measurement  of  anxiety.  Journal  of  Consulting  Psychology , 1960, 
457-462. 

Adjectives,  Per sonality  Measures , Scaling 
Journal  of  Consulting  Psychology,  2U,  p.  462 


R-NA 


